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FIGURE 9 

1 AAAACCCCAA AACCCCAAAA CCCCTTTTAG AGCCCTGCAG TTGGAAATAT 
5 1 AACCTCAGTA TTAATAAGCT CAGATTTTAA ATATTAATTA CAAAACCTAA 
101 ATGGAGGTTG ATGTTGATAA TCAAGCTGAT AATCATGGCA TTCACTCAGC 
1 5 1 TCTTAAGACT TGTGAAGAAA TTAAAGAAGC TAAAACGTTG TACTCTTGGA 
20 1 TCCAGAAAGT TATTAGATGA AG AAATC AAT CTCAAAGTCA TTATAAAGAT 
25 1 TTAGAAGATA TTAAAATATT TGCGCAGACA AATATTGTTG CTACTC CACG 
301 AGACTATAAT GAAGAAGATT TTAAAGTTAT TGCAAGAAAA GAAGTAl 1 1 1 
3 5 1 C AACTGGACT AATG ATCGAA CTTATTG AC A AATGCTTAGT TGAACTTCTT 
401 TCATCAAGCG ATGTTTCAGA TAGACAAAAA CTTCAATGAT TTGGATTTCA 
451 ACTTAAGGGA AATCAATTAG CAAAGACCCA TTTATTAACA GCTCTTTCAA 
50 1 CTCAAAAGC A GTATTTCTTT CAAG ACGAAT GGAACCAAGT TAG AGCAATG 
551 ATTGGAAATG AGCTCTTCCG ACATCTCTAC ACTAAATATT TAATATTCCA 
60 1 GCG AACTTCT G AAGGAACTC TTGTTC AATT TTGCGGGAAT AACGTTTTTG 
651 ATCATTTGAA AGTCAACGAT AAGTTTGACA AAAAGCAAAA AGGTGGAGCA 
701 GCAGACATGA ATGAACCTCG ATGTTGATCA ACCTGCAAAT ACAATGTCAA 
75 1 GAATGAGAAA GATCACTTTC TCAACAAC AT CAACGTGCCG AATTGGAATA 
801 ATATGAAATC AAGAACCAGA ATATTTTATT GCACTCATTT TAATAGAAAT 
85 1 AACCAATTCT TCAAAAAGCA TGAGTTTGTG AGTAACAAAA ACAATATTTC 
901 AGCGATGGAC AGAGCTCAGA CGATATTCAC GAATATATTC AGATTTAATA 
95 1 GAATTAGAAA GAAGCTAAAA G ATAAGGTTA TCGAAAAAAT TGCCTACATG 
1001 CTTGAGAAAG TCAAAGATTT TAACTTCAAC TACTATTTAA CAAAATCTTG 
1 05 1 TCCTCTTCC A GAAAATTGGC GGGAACGG AA ACAAAAAATC GAAAACTTGA 
1101 TAAATAAAAC TAGAGAAGAA AAGTCGAAGT ACTATGAAGA GCTGTTTAGC 
1151 TACACAACTG ATAATAAATG CGTCACACAA TTTATTAATG AATTTTTCTA 
1 20 1 CAATATACTC CCCAAAGACT TTTTGACTGG AAGAAACCGT AAGAATTTTC 
125 1 AAAAGAAAGT TAAGAAATAT GTGGAACTAA ACAAGCATGA ACTCATTCAC 
1 30 1 AAAAACTTAT TGCTTGAGAA GATCAATACA AGAGAAATAT CATGGATGCA 
1351 GGTTGAGACC TCTGCAAAGC ATTTTTATTA TTTTGATCAC GAAAACATCT 
1401 ACGTCTTATG G AAATTGCTC CG ATGG ATAT TCG AGGATCT CGTCGTCTCG 
145 1 CTGATTAGAT GATTTTTCTA TGTCACCGAG CAACAGAAAA GTTACTCCAA 
1 501 AACCTATTAC TACAGAAAGA ATATTTGGGA CGTCATTATG AAAATGTCAA 
1 55 1 TCGCAGACTT AAAGAAGGAA ACGCTTGCTG AGGTCCAAGA AAAAGAGGTT 
1 60 1 GAAGAATGGA AAAAGTCGCT TGG ATTTGC A CCTGGAAAAC TC AG ACTAAT 
1 65 1 ACCGAAGAAA ACTACTTTCC GTCCAATTAT G ACTTTC AAT AAGAAGATTG 
1 701 TAAATTCAGA CCGGAAGACT ACAAAATTAA CTACAAATAC GAAGTTATTG 
1 75 1 AACTCTCACT TAATGCTTAA GACATTGAAG AATAGAATGT TTAAAGATCC 
1801 TTTTGGATTC GCTGTTTTTA ACTATGATGA TGTAATGAAA AAGTATGAGG 
1851 AGTTTGTTTG CAAATGGAAG CAAGTTGGAC AACCAAAACT CTTCTTTGCA 
1901 ACTATGGATA TCGAAAAGTG ATATGATAGT GTAAACAGAG AAAAACTATC 
1 95 1 AACATTCCTA AAAACTACTA AATTACTTTC TTCAGATTTC TGGATTATGA 
2001 CTGCACAAAT TCTAAAGAGA AAGAATAACA TAGTTATCGA TTCGAAAAAC 
2051 TTTAGAAAGA AAGAAATGAA AGATTATTTT AGACAGAAAT TCCAGAAGAT 
2101 TGCACTTGAA GGAGGACAAT ATCCAACCTT ATTCAGTGTT CTTGAAAATG 
2151 AACAAAATGA CTTAAATGCA AAGAA AACAT TAATTGTTGA AGCAAAGCAA 
220 1 AG AAATTATT TTAAGAAAG A TAACTTACTT C AACC AGTC A TTAATATTTG 
225 1 CCAATATAAT TACATTAACT TTAATGGGA A GTTTTATAAA CAAACAAAAG 
2301 GAATTCCTCA AGGTCTTTGA GTTTCATCAA TTTTGTCATC ATTTTATTAT 
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FIGURE 9 (cont) 



2351 GCAACATTAG AGGAAAGCTC CTTAGGATTC CTTAGAGATG AATCAATGAA 
2401 CCCTGAAAAT CCAAATGTTA ATCTTCTAAT GAGACTTACA GATGACTATC 
2451 TTTTGATTAC AACTCAAGAG AATAATGCAG TATTGTTTAT TGAGAAACTT 
2501 ATAAACGTAA GTCGTGAAAA TGGATTTAAA TTCAATATGA AGAAACTACA 
2551 GACTAGTTTT CCATTAAGTC C AAGCAAATT TGCAAAATAC GGAATGGATA 
2601 GTGTTGAGGA GCAAAATATT GTTCAAGATT ACTGCGATTG GATTGGCATC 
2651 TCAATTGATA TGAAAACTCT TGCTTTAATG CCAAATATTA ACTTGAGAAT 
2701 AGAAGGAATT CTGTGTACAC TCAATCTAAA CATGCAAACA AAGAAAGCAT 
2751 CAATGTGGCT CAAGAAGAAA CTAAAGTCGT TTTTAATGAA TAACATTACC 
2801 CATTATTTTA GAAAGACGAT TACAACCGAA GACTTTGCGA ATAAAACTCT 
285 1 CAACAAGTTA TTTATATCAG GCGGTTACAA ATACATGCAA TGAGCCAAAG 
2901 AATACAAGGA CCACTTTAAG AAGAACTTAG CTATGAGCAG TATGATCGAC 
2951 TTAGAGGTAT CTAAAATTAT ATACTCTGTA ACCAGAGCAT TCTTTAAATA 
3001 CCTTGTGTGC AATATTAAGG ATACAATTTT TGGAGAGGAG CATTATCCAG 
305 1 ACTTTTTCCT TAGCACACTG AAGCACTTTA TTGAAATATT CAGCACAAAA 
3101 AAGTAC ATTT TCAAC AG AGT TTGCATGATC CTCAAGGCAA AAGAAGCAAA 
3151 GCTAAAAAGT GACCAATGTC AATCTCTAAT TCAATATGAT GCATAGTCGA 
3201 CTATTCTAAC TTATTTTGGA AAGTTAATTT TCAATTTTTG TCTTATATAC 
3251 TGGGGTTTTG GGGTTTTGGG GTTTTGGGG 




FIGURE 10 

1 MEVDVDNQAD NHGIHSALKT CEEIKEAKTL YSWIQKVIRC RNQSQSHYKD 
51 LEDDGFAQT NIVATPRDYN EEDFKVIARK EVFSTGLMIE LIDKCLVELL 
101 SSSDVSDRQK LQCFGFQLKG NQLAKTHLLT ALSTQKQYFF QDEWNQVRAM 
1 5 1 IGNELFRHLY TKYLIFQRTS EGTLVQFCGN NVFDHLKVND KFDKKQKGGA 
201 ADMNEPRCCS TCKYNVKNEK DHFLNNINVP NWNNMKSRTR IFYCTHFNRN 
251 NQFFKKHEFV SNKNNISAMD RAQTIFTNIF RFNRIRKKLK DKVIEKIAYM 
301 LEKVKDFNFN YYLTKSCPLP ENWRERKQKI ENLINKTREE KSKYYEELFS 
351 YTTDNKCVTQ FINEFFYNIL PKDFLTGRNR KNFQKKVKKY VELNKHELIH 
401 KNLLLEKINT REISWMQVET SAKHFYYFDH ENIYVLWKLL RWIFEDLWS 
451 LER.CFFYVTE QQKSYSKTYY YRKN1WDVIM KMSIADLKKE TLAEVQEKEV 
501 EEWKKSLGFA PGKLRLIPKK TTFRPIMTFN KKIVNSDRKT TKLTTNTKLL 
551 NSHLMLKTLK NRMFKDPFGF AVFNYDDVMK KYEEFVCKWK QVGQPKLFFA 
601 TMDIEKCYDS VNREKLSTFL KTTKLLSSDF W1MTAQILKR KNNIVIDSKN 
651 FRKKEMKDYF RQKFQKIALE GGQYPTLFSV LENEQNDLNA KKTLIVEAKQ 
701 RNYFKKDNLL QPVINICQYN YINFNGKFYK QTKGIPQGLC VSSILSSFYY 
751 ATLEESSLGF LRDESMNPEN PNVNLLMRLT DDYLLITTQE NNAVLFIEKL 
801 INVSRENGFK FNMKKLQTSF PLSPSKFAKY GMDSVEEQNI VQDYCDWIGI 
85 1 SIDMKTLALM PNINLRIEGI LCTLNLNMQT KKASMWLKKK LKSFLMNNIT 
901 HYFRKTITTE DFANKTLNKL FISGGYKYMQ CAKEYKDHFK KNLAMSSMID 
951 LEVSKIIYSV TRAFFKYLVC NIKDTIFGEE HYPDFFLSTL KHF1EIFSTK 
1001 KYIFNRVCMI LKAKEAKLKS DQCQSLIQYD A 
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FIGURE 11 

1 CCCCAAAACC CCAAAACCCC AAAACCCCTA TAAAAAAAGA AAAAATTGAG 
5 1 GTAGTTTAG A AATAAAATAT TATTCCCGCA CAAATGGAGA TGGATATTGA 
1 0 1 TTTGG ATG AT ATAGAAAATT TACTTCCTAA TACATTCAAC AAGTATAGCA 
151 GCTCTTGTAG TGACAAGAAA GGATGCAAAA CATTGAAATC TGGCTCGAAA 
201 TCGCCTTC AT TGACTATTCC AAAGTTGCAA AAACAATTAG AGTTCTACTT 
251 CTCGGATGCA AATCTTTATA ACGATTCTTT CTTGAGAAAA TTAGTTTTAA 
301 AAAGCGGAGA GCAAAGAGTA GAAATTGAAA CATTACTAAT GTTTAAATAA 
35 1 AATCAGGTAA TGAGGATTAT TCTATTTTTT AGATCACTTC TTAAGGAGCA 
401 TTATGGAGAA AATTACTTAA TACTAAAAGG TAAACAGTTT GGATTATTTC 
451 CCTAGCCAAC AATGATGAGT ATATTAAATT CATATGAGAA TGAGTCAAAG 
50 1 G ATCTCG ATA CATCAGACTT ACCAAAGACA AACTCGCTAT AAAACGCAAG 
55 1 AAAAAGTTTG ATAATCGAAC AGCAGAAGAA CTTATTGCAT TTACTATTCG 
601 TATGGGTTTT ATTACAATTG TTTTAGGTAT CGACGGTGAA CTCCCGAGTC 
651 TTGAGACAAT TGAAAAAGCT GTTTACAACT GAAGGAATCG CAGTTCTGAA 
701 AGTTCTGATG TGTATGCCAT TATTTTGTGA ATTAATCTCA AATATCTTAT 
751 CTCAATTTAA TGGATAGCTA TAGAAACAAA CCAAATAAAC CATGCAAGTT 
801 TAATGGAATA TACGTTAAAT CCTTTGGGAC AAATGCACAC TGAATTTATA 
851 TTGGATTCTT AAAGCATAGA TACACAGAAT GCTTTAGAGA CTGATTTAGC 
901 TTACAACAGA TTACCTGTTT TGATTACTCT TGCTCATCTC TTATATCTTT 
951 AAAAGAAGCA GGCGAAATGA AAAGAAGACT AAAGAAAGAG ATTTCAAAAT 
1 00 1 TTGTTG ATTC TTCTGTAACC GGAATTAACA ACAAGAATAT TAGCAACGAA 
1 05 1 AAAGAAGAAG AGCTATCACA ATCCTGATTC TTAAAGATTT CAAAAATTCC 
1 101 AGGTAAGAGA GAT AC ATTC A TTAAAATTCA TATATTATAG TTTTTCATTT 
1151 CACAGCTGTT ATTTTCTTTT ATCTTAACAA TATTTTTTGA TTAGCTGGAA 
1201 GTAAAAAGTA TCAAATAAGA GAAGCGCTAG ACTGAGGTAA CTTAGCTTAT 
1 25 1 TCACATTC AT AGATCGACCT TCATATATCC AATACGATGA TAAGGAAACA 
1301 GCAGTC ATCC GTTTTAAAAA TAGTGCTATG AGGACTAAAT TTTTAGAGTC 
1351 AAG AAATGG A GCCG AAATCT TAATCAAAAA GAATTGCGTC G ATATTGCAA 
1401 AAGAATCGAA CTCT AAATCT TTCGTTAATA AGTATTACCA ATCTTGATTG 
1451 ATTGAAGAGA TTGACGAGGC AACTGCACAG AAGATCATTA AAGAAATAAA 
1501 GTAACTTTTA TTAATTAGAG AATAAACTAA ATTACTAATA TAGAGATCAG 
1551 CGATCTTCAA TTGACGAAAT AAAAGCTGAA CTAAAGTTAG ACAATAAAAA 
1601 ATACAAACCT TGGTCAAAAT ATTGAGGAAG GAAAAGAAGA CCAGTTAGCA 
165 1 AAAGAAAAAA TAAGGCAATA AATAAAATGA GTACAGAAGT GAAGAAATAA 
1701 AAGATTTATT TTTTTCAATA ATTTATTGAA AAGAGGGGTT TTGGGGTTTT 
1751 GGGGTTTTGG GG 
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CCCCAAAACCCCAAAACCCCAAAACCCCTATAAAAAAAGAAAAAATTGAGGTAGTTTAGA 

1 * ♦ * * * 60 

GGGGTTTTGGGGTTTTGGGGTTTTGGGG AT ATTTTTTTCTTTTTTAACTCC ATC AAATCT 

PQNPKTPKPL-KKKKLR'FR 
PKTPKPQNPYKKRKNCGSLE- 
PKPQNPKTPIKKEKIEVV'K- 
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61 * 120 
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MKILFPHKWRWILIWMI.*KI 
IKYYSRTNGDGYCFGC^Y.RKF- 
* N I IPAOMEMDIDLDD ICNL- 

TACTTCCTAATACATTCAACAAGTATAGCAGCTCTTGTAGTGACAAGAAAGGATGCAAAA 

1 21 * "* * * * 180 

ATGAAGGATTATGTAAGTTGTTCATATCGTCGAGAACATCACTGTTCTTTCCTACG^ 

YFLIHSTSIAALVVTRKDAK 
TS -YIQOV-QLL" 0 E R M Q N - 

LPNTrNKTSSSCS D K KOC KT- 

CATTGAAATCTGGCTCGAAATCGCCTTCATTGACTATTCCAAAGTTGCAAAAACA^ 

181 * * 240 

GTAACTTTAGACCGAGCTTTAGCGGAAGTAACTGATAAGGTTTC 
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I E I W L E I A F I DYSKVAKTIR- 
LKSGSKSPSLTIPK-LGKQLK- 



AGTTCTACTTCTCCGATGCAAATCTTTATAACGATTCTTTCTTGAGAAAATTAGTTTTAA 
TCAAGATGA-AGAGCCTACGTTTAGAAATATTGCTAAGAAAGAACTCTTTTAATCAAAATT 
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CATATGAGAATGAGTCAAAGGATCTCGATACATCAGACTTACCAAAGACAAACTCGCTAT 

481 -- - * * 540 

GTATACTCTTACTCAGTTTCCTAGAGCTATGTAGTCTGAATGGTTTCTGTTTGAGCCATA 

HMRMSQRISIHQTYQRQTRY 
b ICECVKGSRY IRLTKOKLAI 

c YEtfESKDLDTSDLPKTNSL * - 

AAAACGCAAGAAAAAGTTTGATAATCGAACAGCAGAAGAACTTATTGCATTTACTATTCG 

541 * * * ♦ * 600 

TTTTGCGTTCTTTTTCAAACTATTAGCTTGTCGTCTTCTTGAATAACGTAAATGATAAGC 

a KTQEKVC * SNSRRTYC I YYS 

b KRKKKFDNRTAEELIAFTIR- 

C NARKSLI1EQQKNLLHLLFV- 

TATGGGTTTTATTACAATTCTTTTAGGTATCGACGGTGAACTCCCGAGTCTTGAGACAAT 

501 * * * * 660 

AT ACCC AAAAT AATGTT AAC AAAATCC AT AGCTGCC ACTTG ACX5GCTC AGAACTCTGTTA 

a YGFYYNCFRYRRCTPESCDN - 

b MGFITIVLGIDGELPSLETI- 
C WVLLQLF'VSTVNSRVLRQL- 

TCAAAAAGCTGTTTACAACTGAAGGAATCCCAGTTCTGAAAGTTCTGATGTGTATGCCAT 

661 * 720 

ACTTTTTCGACAAATGTTGACTTCCTTAGCGTCAAGACTTTCAAGACTACACATACGGTA 

a CKSCLQLKESQFCKF'CVCH 

b EKAVYNCRNRSSESSOVYAI- 

c KKLFTTEG IAVLKVLMCMPU- 

TATTTTGTGAATTAATCTCAAATATCTTATCTCAATTTAATGGATAGCTATAGAAACAAA 

721 * ♦ 780 

AT AAAA C ACTT AATTA G AGTTT AT AG AAT AG AGTT AAATTACCT ATCGAT ATCTTTGTTT 

a YFVN * S Q I SYLNLMDSYRNK 

b ILCfNLKYLISI'WIAIETN- 

c FCELISNI LSQFNG" L " K Q T - 

CCAAATA.^ACCATCCAAGTTTAATGCAATATACGTTAAATCCTTTGGGACAAATGCACAC 

781 * * 840 

GGTTT ATTTGCT AC CTTC AAATT AC CTT ATATGC AATTT AGG AAACCCTGTTTACGTGTG 

a PNKPCKFNCIYVKSFGTNAH 

b QINMASlMEYTLNPLGQMHT- 

c k * t y 0 v " w n i r ' i l w d k c t l - 

tgaatttatattggattcttaaagcatacatacacagaatgctttagagactgatttacc 

841 -- 900 

acTTXAatat -.acctaac -atttcctatctatgtgtcttaccaaatctctcactaaatcc 

d C I V : C f - K HRYTECFRDCFS 

b EFI-OS'SIDTQNALETD LA- 

C N L V /. I L < A " I HRML * RLI * L - 

ttacaacagattacctgttttgattactcttgctcatctcttatatctttaaaagaagca 

901 960 

aatgttgtct^tccacaaaactaatgagaacgagtagagaatatagaaattttcttcgt 

a LQQ; r CfDYSCSSLISLXEA 

b YNRLPVLITLAHLLYL'KKQ- 
c TTDYLrCLLLLISY I FKRSR- 

gccgaaatgaaaagaacactaaagaaagagatttcaaaatttgttgattcttctgtaacc 

961 - ♦ 1020 

cccctttacttttcttctcatttctttctctaaagttttaaacaactaagaagacattgg 

a cemk3rlx k £ i sk fvossvt 

b axcked'rkrfqnllill'p- 
c rnehxtkerdfkiccffcnr- 

gcaattaaca.acaagaatattagcaacgaaaaagaagaagacctatcacaatcctgattc 

1021 1080 

CCTTAATTGT^CTTCTTATA-^TCCTTGCTTTTTCTTCTTCTCGATAGTGTTAGGACTAAG 

a cinn.nnisnekeeelsoscf 

b ELT7RI_ATKKKKSYHNPDS- 
c N*-OOEY-0?KR3RAITILrL- 
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TT AAAGATTTC AAAAATTCC AGGT AAG AG AG AT AC ATTC ATT AAAATTC AT AT ATT ATAG 

1081 U40 

AATTTCTAAAGTTTTTAAGGTCCATTCTC^ 

a LKISKIPGKRDTFI K I H I L 

b • R F Q K F Q V R E I H S L K F I Y Y S - 

C KDFKNSR * E R Y I H * N S Y I I V - 

TTTTTCATTTCACAGCTGTTATTTTCTTTTATCT^ 

1141 * * + ♦ 1200 

AAAAAGTAAAGTGTCGACAATAAAAGAAAATAGAATTGTTATAAAAAACTAATCGACCTT 

a FFISQLLFSFILTI F F D * L E 

b FSFHSCYFL.LS 'QYFLISWK- 

c FHFTAVI FFY L N N I FCLAGS- 

GTAAAAAGTATCAAATAAGAGAAGCGCTAGACTGAGGTAACTTAGCTTATTCACATTCAT 

1201 * * 1260 

CATTTTTCATAGTTTATTCTCTTCGCGATCTGACTCCATT^ 

a VKSIK*EKR * T £ V T * LIHIH 

b * KVSNKRSARLR * L SLFTF I 

c KKYQIREALDCGNLAYSHS • - 

AGATCGACCTTCATATATCCAATACGATGATAAGGAAACAGCAGTCATCCGTTTTAAAAA 

1261 * * * * * 1320 

TCTAGCTGGAAGTATATAGGTTATGCTACTATTCCTTTGTCGTCAGTAGX3CAAAATTTTT 

a RSTFIYPIRC-GNSSHPF-K 

b ORPSYIQYDDKETAVIRFKN- 

c IDLHISNTMIRKQQSSVLKI- 

TAGTGCTATGAGGACTAAArTTrrAGAGTCAAGAAATCGAGC 

1321 * - 1380 

ATCACGATACTCCTGATTTAAAAATCTCAGTTCTTTACCTCGGCTCT 

a * C Y E D * IFRVKKWSRNLNQK 

b SAMRTKFLESRNGAE I L IKK 

c VLCGLNF * SQEHEPKS * SKR- 

GAATTGCGTCGATATTGCAAAAGAATCGAACTCTAAATCTTTCGTTAATAAGTATTACCA 

1381 - + 1140 

CTTAACGCAGCTATAACGTTTTCTTAGCTTGAGATTTAGAAAGCAATTATTCATAATGGT 

a EL.RHYCKR I £ L * IFR * *VLP 

b NCVUIAKESNSKSFVNKYYQ- 

c ias:lqknrtlnlslisitn- 

ATCTTGATTGATTGAAGAGATTGACCAGGCAACTGCACAGAAGATCATTAAAGAAATAAA 

1441 1500 

TAGAACTAAC7A.ACTTCTCTAACTGCTCCGTTGACGTGTCTTCTAGTAATTTCTTTATTT 

a I LIDCRD" RGNCT EDH * RNK 

b SCLIEEIDEATAQKI I X E I K - 

C LDCLKRLTRQLHR RSLKK * 3 - 

GTAACTTTTATTAATTAGAGAATAAACTAAATTACTAATATACAGATCAGCGATCTTCAA 

1501 * - 1560 

CATTGAAAAT^TTAATCTCTTATTTGATTTAATGATTATATCTCTAGTCGCTAGAACTT 

a VTFIM'RIN-ITNtEISDLQ 

b LLLIRE'TKLLC * R S A I F N - 

C NFY'LENKLNY * Y RDQRSS I- 

TTGACGAAATaAAAGCTGAACTAAAGTTAGACAATAAAAAATACAAACCTTGCTCAAAAT 
1561 - 1620 

AAC TGCTTTATTTTCG a cttg atttc aatctg tta tttttt atgtttgg aacc agtttt A 

a ltk*:<ln*s-tikntnlcqn 
b crnksctkvrq'k i q t l v k [ 

c deikaelkldnkk y k pwsky- 

attgaggaagcaaaagaagaccagttagcaaaagaaaaaataacgcaataaataaaatga 
1621 - - 1680 

taactccttccttttcttctggtcaatcgttttcttttttattcccttatttattttact 
a ieegkedolakekcso*!kc 

b LRKEKKTS'QKKK'GNK'NE- 
C CGRXRRPVSKRKNXAfNKMS- 
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GTACAGAACTGAAGAAATAAAAGATTTATTTTTTTCAATAATTTATTGAAAAGAGGGGTT 



CATCTCTTCACTTCTTTATTTTCTAAATAAAAAAAGTTATTA^ 

a vQKCRNKRFIFFNNLLKRGV 

b YRSEEIKDLFFSriYCKEGF- 

c tevkk*kiyFfq*fiekrcf- 

TTGGGG TTTTGGGGTTTTGGGG 

1741 -- 1762 

AACCCCAAAACCCCAAAACCCC 

a LGrWGFG 

b WCFGVLG- 

c G V I G F W 
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FIGURE 19 



1 aactcattta attactaatt taatcaacaa gattgataaa aagcagtaaa taaaacccaa 
61 tagatttaat ttagaaagta tcaattgaaa aatggaaatt gaaaacaact aagcacaata 
121 gccaaaagcc gaaaaattgt ggtgggaact tgaattagag atgcaagaaa accaaaatga 
181 tatataagtt agggttaaga ttgacgatcc taagcaatat ctcgtgaacg tcactgcagc 
24 1 atgtttgttg taggaaggta gttactacta agataaagat gaaagaagat atatcatcac 
301 taaagcactt cttgaggtgg ctgagtctga tcctgagttc atctgctagt tggcagtcta 
361 catccgtaat gaactttaca tcagaactac cactaactac attgtagcat tttgtgttgt 
42 1 ccacaagaat actcaaccat tcatcgaaaa gtacttcaac aaagcagtac ttttgcctaa 
481 tgacttactg gaagtctgtg aatttgcata ggttctctat atttttgatg caactgaatt 
54 1 caaaaatttg tatcttgata ggatactttc ataagatatt cgtaaggaac tcactttccg 
601 taagtgttta caaagatgcg tcagaagcaa gttttctgaa ttcaacgaat actaacttgg 
661 taagtattgc actgaatcct aacgtaagaa aacaatgttc cgttacctct cagttaccaa . 
721 caagtaaaag tgggattaaa ctaagaagaa gagaaaagag aatctcttaa ccaaacttta 
78 1 ggcaataaag gaatctgaag ataagtccaa gagagaaact ggagacataa tgaacgttga 
84 1 agatgcaatc aaggctttaa aaccagcagt tatgaagaaa atagccaaga gatagaatgc 
901 catgaagaaa cacatgaagg cacctaaaat tcctaactct accttggaat caaagtactt 
961 gaccttcaag gatctcatta agttctgcca tatttctgag cctaaagaaa gagtctataa 
1021 gatccttggt aaaaaatacc ctaagaccga agaggaatac aaagcagcct ttggtgattc 
1081 tgcatctgca cccttcaatc ctgaattggc tggaaagcgt atgaagattg aaatctctaa 
1 141 aacatgggaa aatgaactca gtgcaaaagg caacactgct gaggtttggg ataatttaat 
1201 ttcaagcaat taactcccat atatggccat gttacgtaac ttgtctaaca tcttaaaagc 
1261 cggtgtttca gatactacac actctattgt gatcaacaag atttgtgagc ccaaggccgt 
1321 tgagaactcc aagatgttcc ctcttcaatt ctttagtgcc attgaagctg ttaatgaagc 
1381 agttactaag ggattcaagg ccaagaagag agaaaatatg aatcttaaag gtcaaatcga 
1441 agcaglaaag gaagttgttg aaaaaaccga tgaagagaag aaagatatgg agttggagta 
1501 aaccgaagaa ggagaatttg ttaaagtcaa cgaaggaatt ggcaagcaat acattaactc 
1561 cattgaactt gcaatcaaga tagcagttaa caagaattta gatgaaatcaaaggacacac 
1 62 1 tgcaatcttc tctgatgttt ctggttctat gagtacctca atgtcaggtg gagccaagaa 
1 68 1 gtatggttcc gttcgtactt gtctcgagtg tgcattagtc cttggtttga tggtaaaata 
1741 acgttgtgaa aagtcctcat tctacatctt cagttcacct agttctcaat gcaataagtg 
1801 ttacttagaa gttgatctcc ctggagacga actccgtcct tctatgtaaa aacttttgca 
1861 agagaaagga aaacttggtg gtggtactga tttcccctat gagtgcattg atgaatggac 
1921 aaagaataaa actcacgtag acaatatcgt tattttgtct gatatgatga ttgcagaagg 
1981 atattcagat atcaatgtta gaggcagttc cattgttaac agcatcaaaa agtacaagga 
204 1 tgaagtaaat cctaacatta aaatctttgc agttgactta gaaggttacg gaaagtgcct 
2101 taatctaggt gatgagttca atgaaaacaa ctacatcaag atattcggta tgagcgattc 
2161 aatcttaaag ttcatttcag ccaagcaagg aggagcaaat atggtcgaag ttatcaaaaa 
2221 ctttgccctt caaaaaatag gacaaaagtg agtttcttga gattcttcta taacaaaaat 
2281 ctcaccccac ttttttgttt tattgcatag ccattatgaa atttaaatta ttatctattt 
2341 atttaagtta cttacatagt ttatgtatcg cagtctatta gcctattcaa atgattctgc 
2401 aaagaacaaa aaagattaaa a 




FIGURE 20 

MEIENNQAQQPKAEKLWWELELEMQENQNDIQVRVKIDDPKQYL 

VNVTAACLLQEGSYYQDKDERKYIITKALLEVAESDPEFICQLAVYIRI^ELYIRTTTN 

YIVAFCWHKNTQPFIEKYFNKAVLLPNDLLEVCEFAQVLYIFDATEFKMLYLDR1LS 

QDIRKELTFRKCLQRCVRSKJSEFNEYQLG 
KJKRKENLLTTCLQAIKESEDKSKJ^TGM 

APKJPNSTLESK YLTFKDH KFC HI S EPKER VYKILGKKYPKTEEEYKAAFGDS ASAP 

FNPEL AGKRMK1EI S KT WENELS AKGNTAEV WDNLI S SNQLPYMAMLRNLSNILKAG V 

SDTTHSIVmKJCEPK^VENSKJvlFPLQFFSAIEAVNEAVTKGFKAKKRENMNL 

AVK^VVEKTDEEKJO)MELEQTEEGEFVKVNEGIGKQYINSIELAIKIAVNKNLDEIKG 

HTAIFSDVSGSMSTSMSGGAKKYGSVRTCLECALVLGLMVKQRCEKSSFY1FSSPSSQ 

CNKCYLEVDLPGDELRPSMQKLLQEKGKLGGGTDFPYECIDEWTKNKTHVDNIVILSD 

MMIAEGYSDRWRGSSIVNSIKKYKJDEVNPNIKIFAVDLEGYGK^^ 

K1FGMSDSILKFISAKQGGANMVEVIKNFALQKJGQK 




FIGURE 21 

1 tcaatactat taattaataa ataaaaaaaa gcaaactaca aagaaaatgt caaggcgtaa 
61 ctaaaaaaag ccataggctc ctataggcaa tgaaacaaat cttgattttg tattacaaaa 
121 tctagaagtt tacaaaagcc agattgagca ttataagacc tagtagtaat agatcaaaga 
181 ggaggatctc aagcttttaa agttcaaaaa ttaagattag gatggaaact ctggcaacga 
241 tgatgatgat gaagaaaaca actcaaataa ataataagaa ttattaagga gagtcaatta 
301 gattaagtag caagtttaat tgataaaaaa agttggttct aaggtagaga aagatttgaa 
361 tttgaacgaa gatgaaaaca aaaagaatgg actttctgaa tagcaagtga aagaagagta 
421 attaagaacg attactgaag aataggttaa gtattaaaat ttagtattta acatggacta 
481 ccagttagat ttaaatgaga gtggtggcca tagaagacac agaagagaaa cagattatga 
541 tactgaaaaa tggtttgaaa tatctcatga ccaaaaaaat tatgtatcaa tttacgccaa 
601 ctaaaagaca tcatattgtt ggtggcttaa agattatttt aataaaaaca attatgatca 
661 tcttaatgta agcattaaca gactagaaac tgaagccgaa ttctatgcct ttgatgattt 
72 1 ttcacaaaca atcaaactta ctaataattc ttactagact gttaacatag acgttaattt 
781 tgataataat ctctgtatac tcgcattgct tagattttta ttatcactag aaagattcaa 
841 tattttgaat ataagatctt cttatacaag aaattaatat aattttgaga aaattggtga 
901 gctacttgaa actatcttcg cagttgtctt ttctcatcgc cacttacaag gcattcattt 
961 acaagttcct tgcgaagcgt tctaatattt agttaactcc tcatcataaa ttagcgttaa 
1 021 agatagctaa ttataggtat actctttctc tacagactta aaattagttg acactaacaa 
1081 agtccaagat tattttaagt tcttataaga attccctcgt ttgactcatg taagctagta 
1141 ggctatccca gttagtgcta ctaacgctgt agagaacctc aatgttttac ttaaaaaggt 
1201 caagcatgct aatcttaatt tagtttctat ccctacctaa ttcaattttg atttctactt 
1261 tgttaattta taacatttga aattagagtt tggattagaa ccaaatattt tgacaaaaca 
1321 aaagcttgaa aatctacttt tgagtataaa ataatcaaaa aatcttaaat ttttaagatt 
1381 aaacttttac acctacgttg cttaagaaac ctccagaaaa cagatattaa aacaagctac 
1441 aacaatcaaa aatctcaaaa acaataaaaa tcaagaagaa actcctgaaa ctaaagatga 
1501 aactccaagc gaaagcacaa gtggtatgaa attttttgat catctttctg aattaaccga 
1561 gcttgaagat ttcagcgtta acttgtaagc tacccaagaa atttatgata gcttgcacaa 
1621 acttttgatt agatcaacaa atttaaagaa gttcaaatta agttacaaat atgaaatgga 
1681 aaagagtaaa atggatacat tcatagatct taagaatatt tatgaaacct taaacaatct 
1741 taaaagatgc tctgttaata tatcaaatcc tcatggaaac atttcttatg aactgacaaa 
1801 taaagattct actttttata aatnaagct gaccttaaac taagaattat aacacgctaa 
1 86 1 gtatactm aagtagaacg aattttaatt taataacgtt aaaagtgcaa aaattgaatc 
1921 ttcctcatta gaaagcrtag aagatattga tagtctttgc aaatctattg cttcttgtaa 
1981 aaamacaa aatgttaata ttatcgccag tttgctctat cccaacaata tttagaaaaa 
204 1 tcctttcaat aagcccaatc ttctattttt caagcaattt gaataattga aaaatttgga 
2101 aaatgtatct atcaactgta ttcttgatca gcatatactt aattctattt cagaattctt 
2161 agaaaagaat aaaaaaataa aagcattcat tttgaaaaga tattatttat tacaatatta 
2221 tcttgattat actaaattat ttaaaacact tcaatagtta cctgaattaa attaagttta 
2281 cattaattag caattagaag aattgactgt gagtgaagta cataagtaag tatgggaaaa 
2341 ccacaagcaa aaagctttct atgaaccatt atgtgagttt atcaaagaat catcctaaac 
2401 cctttagcta atagattttg accaaaacac tgtaagtgat gactctatta aaaagatttt 
2461 agaatctata tctgagtcta agtatcatca ttatttgaga ttgaacccta gttaatctag 
2521 cagtttaatt aaatctgaaa acgaagaaat ttaagaactt ctcaaagctt gcgacgaaaa 
2581 aggtgtttta gtaaaagcat actataaatt ccctctatgt ttaccaactg gtacttatta 
2641 cgattacaat tcagatagat ggtgattaat taaatattag tttaaataaa tattaaatat 
2701 tgaatatttc tttgcttatt atttgaataa tacatacaat agtcattttt agtgttttga 
2761 atatatttta gttatttaat tcattatttt aagtaaataa ttatttttca atcatttttt 
2821 aaaaaatcg 
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MSRRMQKKPQAP1GNETNLDFVLQNLEVYKSQIEHYKTQQQQIK 

EEDLKLLKFKNQDQDGNSGNDDDDEENNSNKQQELLRRVNQIKQQVQLIKKVGSKVEK 

DLNLNEDENKJWGLSEQQVKJEEQLRTITEEQVKYQNLVFNMDYQLDLNESGGHRRHRR 

ETDYDTEKWEISHDQKNYVSIYANQKTSYCWWLKJDYFNKNWDHLNVSINRLETEAE 

FYAFDDFSOTIKLT^INSYQTVNIDVNFDNNLCILALLRFLLSLERFNIL^fIRSSYTRN 

QYNFEKJGELLETIFAVVFSHRHLQGIHLQVPCEAFQYLVNSSSQISVKDSQLQVYSF 

STDLiaVDTNKVQDYFKFLQEFPRLTHVSQQAIPVSATNAVENLl^LKKVKHANLNL 

VSIPTQFNFDFYFWLQHLKXEFGLEPNILTKQKLENLLLSIKQSKNLKFLRLNFYTY 

VAQETSRKQILKQATT1KNLIGWKNQEETPETKJDETPSESTSGMKFFDHLSELTE 

FSVNLQATQEIYDSLHKLLIRSTNLKKFKLSYKYEMEKSKMDTFIDLKNIYETLNNLK 

RCSVNISNPHGNISYELTNKDSTFYKFKLTLNQELQHAKYTFKQNEFQFNNVKSAKIE 

SSSLESLEDIDSLCKSIASCKNLQNVNIIASLLYPNNIQKNPFNKPNLLFFKQFEQLK 

NLEhA/SINCILDQHILNSISEFLEKNKKIKAFILKRYYLLQYYLDYTKLFKTLQQLPE 

LNQVYn^QQLEELTVSEVHKQVWENHKQKAFYEPLCEFIKESSQTLQLIDFDQNTVSD 

DSIKKJLESISESKYHHYLRLNPSQSSSLIKSENEEIQELLKACDEKGVLVKAYYKFP 

LC LPTGTY YDYN S D R W 
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MKJLFEFIQDKLDIDLQTNSTYKENLKCGHFNGLDEILTTCFAL 

PNSR1GALPCLPGDLSHKAVIDHCI1YLLTGELYNNVLTFGYKIARNEDVNNSLFCHS 

ANVNVTLLKGAAWKMFHSLVGTYAFVDLLrNYTVIQFNGQFFTQIVGNRCNEPHLPPK 

WVQRSSSSSATAAQIKQLTEPVTNKQFLHKLNfNSSSFFPYSKlLPSSSSIKKLTDLR 

EAIFPTNLVKJPQRLKVRJNLTLQKLLKJUiKRLNYVSILNSICPPLEGTVLDLSHLSR 

QSPKERVLKfIIVILQKI,LPQEMFGSKJCNKGKIIKNLNLLLSLPLNGYLPFDSLLKKL 

RLKDFRWLFISDIWFTKHNFENLNQLAICFISWLFRQLIPKIIQTFFYCTEISSTVTI 

VWRHDTWhnaiTPFIVEYFKTYLVENNVCRNHNSYTL^^ 

IIAIPCRGADEEEFTIYKENHKWAIQPTQKILEYLRNKJRPTSFTKJYSPTQ1ADR1KE 

FKQRLLKKiWVLPELYFMKFDVKSCYDSIPRMECMRILKDALKNENGFFVRSQYFFN 

TNTGVLKLFNVVNASRVPKPYELYIDNVRTVHLSNQ 

IREDGLFQGSSLSAPIVDLVYDDLLEFYSEFKASPSQDTLILKLADDFLIISTDQQQV 

INIKJaAMGGFQKYNAKANRDKJLAVSSQSDDDTVIQFCAMfflFVKELEVWKHSSTMN 

NFHIRSKSSKGIFRSLlALFNTRiSYKTIDTNLNSTNTVLMQIDHVVKNISECYKSAF 

KDLSINVTQNMQFHSFLQR11EMTVSGCPITKCDPLIEYEVRFTILNGFLESLSSNTS 

KFKDNHLLRKEIQHLQAYIYIYIHIVN 
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Oxythcha LCVSY1LSSFYYANLEENALQFLRKESMDPEKPETNLLMRLT 
Euplotes LCVSSILSSFYYATLEESSLGFLRDESMNPENPNVNLLMRLT 
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Motif 0 

* human " AXFLHWLMS VY VVELLRS FF Y VTETTFQXHR 

tezl ISEIBWLVLOKRSNAXMCLSDrEKRKQIFAEFIYWLYNSFIIPILQSFFYlTBSSDJLRMR 

EST2 LKDFRWLFISD IWFTKHNFEKLNQLAICFISWLFRQLIPKI IQTFFYCTEISSTVT- 

pl23 TRE I SWMQ V ET-SAXHFYYF DHEN - 1 Y VLVf K LL RW I FEDL WSL I RCF F Y VX EQQK S Y S K 

. * ** 

Motif 1 

human LFFYRKSVWSKLQ6I0IRQKLKRVQLRDVSEAEVRQHREARPALLTSRJLRFIPKP--D0L 
tezl TVYFRKDlWXLLCRPFI-TSMKMEAFEKINENNVRMDTQK-TTLPPAVIRLLPKX--irrF 
PST2 IVYFRMDlVHKLITPFIVEYFKTYI,VEKHVCRNHNSYTLfi — HFNKSKMRI IPKK5NNEF 

Jl23 TYYYRKHIWDVIMKMSI-A0LKKETLABVQEKEVEEWKKS-LGFAPGKLRLIPKK — TTF 

s S • •*.*.** . * - * * 

•H Motif 2 

ijiuma n RP I VNMD YWGARTFRREKRAERLTSRVK ALF -S VLNYERA 

l%*zl RLXTN-LRKRFLI KM0SHKKMLV6TKQTLRPVAS ILKHlilHEESSGl PFHLEV YMKLLTF 

JST2 RIIAIPCRGADEEEFTIYKENHKKAIQPTQKILEYLRNKRPTSFTKIYSPTQIADRIKEF 

^£123 RPIMTFNKKIVNSDRKTTKLTTNTKIXR5HLMLKXLXN-RMFKDPFGFAVFKYDDVMKKY 
* # * * 

P Motif 3 (A) 

'Iteil KKDLLKHRMFOR-KKYFVRIDIKSCYDRIKQDLKFRIVKK-KLKDPEFVIRKYATIHATS 
|JEST2 KQRLLKKFNNVLPELYFMKFDVK6CYDS I PRMECMRI LKD- AJLKNENGFFVRSQYFFtfTN 

|^pl23 EEFVCKWKQVG^PKLKFATMDIEKCYDSVNREKLSTFLKTTKLLSSDFWIMTAQILKRJCK 
. * * *** . . .* * 
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ATTTATACTCATGAAAATCTTATTCGAGTTCATTCAAGACAAGCTTGACATTGATCTACA 
GACCAACAGTACTTACAAAGAAAATTTAAAATGTGGTCACTTCAATGGCCTCGATGAAAT 
TCTAACTACGTGTTTCGCACTACCAAATTCAAGAAAAATAGCATTACCATGCCTTCCTGG 
TGACTTAAGCCACAAAGCAGTCATTGATCACTGCATCATTTACCTGTTGACGGGCGAATT 
ATACAACAACGTACTAACATTTGGCTATAAAATAGCTAGAAATGAAGATGTCAACAATAG 
TCTTTTTTGCCATTCTGCAAATGTTAACGTTACGTTACTGAAAGGCGCTGCTTGGAAAAT 
GTTCCACAGTTTGGTCGGTACATACGCATTCGTTGATTTATTGATCAATTATACAGTAAT 
TCAATTTAATGGGCAGTTTTTCACTCAAATCGTGGGTAACAGATGTAACGAACCTCATCT 
GCCGCCCAAATGGGTCCAACGATCATCCTCATCATCCGCAACTGCTGCGCAAATCAAACA 
ACTTACAGAACCAGTGACAAATAAACAATTCTTACACAAGCTCAATATAAATTCCTCTTC 
TTTTTTTCCTTATAGCAAGATCCTTCCTTCATCATCATCTATCAAAAAGCTAACTGACTT 
GAGAGAAGCTATTTTTCCCACAAATTTGGTTAAAATTCCTCAGAGACTAAAGGTACGAAT 
TAATTTGACGCTGCAAAAGCTATTAAAGAGACATAAGCGTTTGAATTACGTTTCTATTTT 
GAATAGTATTTGCCCACCATTGGAAGGGACCGTATTGGACTTGTCGCATTTGAGTAGGCA 
AT C ACC AAAGG AA CG AGT CTTGAAATTT AT C ATTGTT ATTTT ACAG AAGTTATT A C C C C A 
AGAAATGTTTGGCTCAAAGAAAAATAAAGGAAAAATTATCAAGAATCTAAATCTTTTATT 
AAGTTTACCCTTAAATGGCTATTTACCATTTGATAGTTTGTTGAAAAAGTTAAGATTAAA 
GGATTTTCGGTGGTTGTTCATTTCTGATATTTGGTTCACCAAGCACAATTTTGAAAACTT 
GAATCAATTGGCGATTTGTTTCATTTCCTGGCTATTTAGACAACTAATTCCCAAAATTAT 
ACAGACTTTTTTTTACTGCACCGAAATATCTTCTACAGTGACAATTGTTTACTTTAGACA 
TGATACTTGGAATAAACTTATCACCCCTTTTATCGTAGAATATTTTAAGACGTACTTAGT 
CGAAAACAACGTATGTAGAAACCATAATAGTTACACGTTGTCCAATTTCAATCATAGCAA 
AATGAGGATTATACCAAAAAAAAGTAATAATGAGTTCAGGATTATTGCCATCCCATGCAG 
AGGGGCAGACGAAGAAGAATTCACAATTTATAAGGAGAATCACAAAAATGCTATCCAGCC 
CACTCAAAAAATTTTAGAATACCTAAGAAACAAAAGGCCGACTAGTTTTACTAAAATATA 
TTCTCCAACGCAAATAGCTGACCGTATCAAAGAATTTAAGCAGAGACTTTTAAAGAAATT 
TAATAATGTCTTACCAGAGCTTTATTTCATGAAATTTGATGTCAAATCTTGCTATGATTC 
CATACCAAGGATGGAATGTATGAGGATACTCAAGGATGCGCTAAAAAATGAAAATGGGTT 
TTTCGTTAGATCTCAATATTTCTTCAATACCAATACAGGTGTATTGAAGTTATTTAATGT 
TGTTAACGCTAGCAGAGTACCAAAACCTTATGAGCTATACATAGATAATGTGAGGACGGT 
T C ATTT AT C AAATCAGG ATGTT AT AAACGTTGT AG AG ATGG AAAT ATTT AAAAC AGCTTT 
GTGGGTTGAAGATAAGTGCTACATTAGAGAAGATGGTCTTTTTCAGGGCTCTAGTTTATC 
TGCTCCGATCGTTGATTTGGTGTATGACGATCTTCTGGAGTTTTATAGCGAGTTTAAAGC 
CAGTCCTAGCCAGGACACATTAATTTTAAAACTGGCTGACGATTTCCTTATAATATCAAC 
AGACCAACAGCAAGTGATCAATATCAAAAAGCTTGCCATGGGCGGATTTCAAAAATATAA 
TGCGAAAGCCAATAGAGACAAAATTTTAGCCGTAAGCTCCCAATCAGATGATGATACGGT 
TATTCAATTTTGTGCAATGCACATATTTGTTAAAGAATTGGAAGTTTGGAAACATTCAAG 
CACAATGAATAATTTCCATATCCGTTCGAAATCTAGTAAAGGGATATTTCGAAGTTTAAT 
AGCGCTGTTTAACACTAGAATCTCTTATAAAACAATTGACACAAATTTAAATTCAACAAA 
CACCGTTCTCATGCAAATTGATCATGTTGTAAAGAACATTTCGGAATGTTATAAATCTGC 
TTTTAAGGATCTATCAATTAATGTTACGCAAAATATGCAATTTCATTCGTTCTTACAACG 
CATCATTGAAATGACAGTCAGCGGTTGTCCAATTACGAAATGTGATCCTTTAATCGAGTA 
TGAGGTACGATTCACCATATTGAATGGATTTTTGGAAAGCCTATCTTCAAACACATCAAA 
ATTTAAAGATAATATCATTCTTTTGAGAAAGGAAATTCAACACTTGCAAGC 
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AKFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIG1RQHLKR 

VQLRDVSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYVVGARTFRREKR 

AERLTSRVKALFSVLNYERA 
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GCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTC 

TTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCl 1 1 1 1CTACC 

GGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGGAATCAGACAGCACTTGAA 

GAGGGTGCAGCTGCGGGACGTGTCGGAAGCAGAGGTCAGGCAGCATCGGGA 

AGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGACG 

GGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCG 

CAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGC 

GTGCTCAACTACGAGCGGGCGCG 
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MTEHHTPKSRILRf LENQYVYLCTLNDYVQLVLRGSPASSYSN1CERLRSDVQTSFS1FLHSTVVGF 

DSKPDEGVQFSSPKCSQSELIANVVKQMFDESFERIUINLLMKGFSMNHEDFRAMHYNGVQNDLV 

STFPNYLISILESKNWQLLLE11GSDAMHYLLSKGSIFEALPNDNYLQISGIPLFKNNVFEETVSKKRK 

RTIETSITQNKSARKEVSWNSISISRFS1FYRSSYKKFKQDLYFNLHS1CDRNTVHMWLQWIFPRQFG 

LFNAFQVKQLHKVIPLVSQSTVVPKRLLKVYPLIEQTAKRLHRISLSKVYNHYCPYIDTHDDEKILS 

YSLKPNQVFAFLRS1LVRVFPKLIWGNQRIFEIILKDLETFLKLSRYESFSLHYLMSNIKISEIEWLVL 

GKRSNAKMCLSDFEKRKQIFAEFIYWLYNSFIIPILQSFFY^ 

SMKMEAFEKI^IE^^^RMDTQKTTLPPAVIRLLPKKNTFRLITNLRKRFLIK^ 

LRPVASILKHLINEESSGIPFNLEVYMKLLTFKXDLLKHRMFGRKKYFVRIDIKSCYDRIKQDLMFR 

IVKKKLKDPEFVIRKYATIHATSDRATKNFVSEAFSYFDMVPFEKVVQLLSMKTSDTLFVDFVDY 

WTKSSSE1FKML1CEHLSGHIVKIGNSQYLQKVGIPQGSILSSFLCHFYMEDLIDEYLSFTKKKGSVL 

LRVVDDFLFITVNKKDAKKFLNLSLRGFEK^ 

FSVNMRSLDTLLACPKIDEALFNSTSVELTKHMGKSFFYKILRSSLASFAQVFIDITHNSKFNSCCN1 

YRLGYSMCMRAQAYLKRMKDIFIPQRMFITDLLNVIGRXIWKKLAEILGYTSRRFLSSAEVKWLFC 

LGMRDGLKPSFKYHPCFEQLIYQFQSLTDLIKPLRPVLRQVLFLHRJUAD 



FIGURE 30' 

ggtaccgatttactttcctttcttcataagctaattgcttcctcgaacgctcctaaatctctggaaatatttnacaagaactcaataacaataccaagtcaaattccaatatgaagg 
tgttattagtgatcgataatamctatmatcggtcgttaccaagtataaggacaaaaaga^ 

ggncgcttactmaatcgtggtactgtmagctgctacttctagccaaccgcgtgmctaccccgtcattggatatagctcttggagtagctcacagaaatccttacaaatctt 

ctgatgagactatanagancattacagtccgtgcatattcttaacatggagccttacacmagatgagtcacgtcgcatgatggagtatnggtatcatccaacgtUgcctlg 

aaaaggttgataattamgcaaaatcatgtccttagtggtggtaatccgcgaaag^tttttgatgcttgcacacgtctagcatgattgagatattc 

ctccmaacgcggttttattmctamtctattctcatgttgttccaaatatgm 

aataatctaaattagmcgcttataattgatagtagtagaaagattggtgattctactcgt^ 

aaaatcctataattataaaiattaatcaatamgcggtcactamatttaaaacgttatgatc^ 

GACCGAACACCATACCCCCAAAAGCAGGATTCTTCGCTTTCTAGAGAATCAATATGTATACCTATGTA 

CCTTAAATGATTATGTACAACTTfeTTTTGAGAGGGTCGCCGGCAAGCTCGTATAGCAATATATGCGAA 

CGCTTGAGAAGCGATGTACAAACGTCCTTTTCTA'li"]"] 1 CTTCATTCGACTGTAGTCGGCTTCG ACAGT 

AAGCCAGATGAAGGTGTTCAATTTTCTTCTCCAAAATGCTCACAGTCAGAGgtatatatattmgtmgatm 

ggatagctaatatatgggcagCTAATAGCGAATGTTGTAAAACAGATGTTCGATGAAAGTTTTGAGCGTCGAAGGA 

ATCTACTGATGAAAGGG l ViTCCATGgtaaggtattctaattgtgaaatatttacctgcaattactgtttcaaagagangtatttaaccgataaagAA 

TCATGAAGATTTTCGAGCCATGCATGTAAACGGAGTACAAAATGATCTCGTTTCTACTTTTCCTAATTA 

CCTTATATCTATACTTGAGTCAAAAAATTGGCAACTTTTGTTAGAAATgtaaataccggttaagatgUgcgcacmgaaca 

agactgacaagtatagTATCGGCAGTGATGCCATGCATTACTTATTATCCAAAGGAAGTATTTTTGAGGCTCTTC 

CAAATGACAATTACCTTCAGATTTCTGGCATACCACTTTTTAAAAATAATGTGTTTGAGGAAACTGTGT 

CAAAAAAAAGAAAGCGAACCATTGAAACATCCATTACTCAAAATAAAAGCGCCCGCAAAGAAGTTTC 

CTGGAATAGCATTTCAATTAGTAGGTTTAGCATTTTTTACAGGTCATCCTATAAGAAGTTT 

aactaatactgttatccncataactaatmagATCTATATTTTAACTTACACTCTATTTGTGATCGGAACACAGTACACATG 

TGGCTTCAATGGATTTTTCCAAGGCAATTTGGACTTATAAACGCATTTCAAGTGAAGCAATTGCACAA 

AGTGATTCCACTGGTATCACAGAGTACAGTTGTGCCCAAACGTCTCCTAAAGGTATACCCTTTAATTGA 

ACAAACAGCAAAGCGACTCCATCGTATTTCTCTATCAAAAGTTTACAACCATTATTGCCCATATATTGA 

CACCCACGATGATGAAAAAATCCTTAGTTATTCCTTAAAGCCGAACCAGGTGTTTGCGTTTCTTCGATC 

CATTCTTGTTCGAGTGTTTCCTAAATTAATCTGGGGTAACCAAAGGATATTTGAGATAATATTAAAAGg 

tattgtataaaamattaccactaacgattttaccagACCTCGAAACTT^^ 

ATTATTTAATGAGTAACATAAAGgtaatatgccaaattttmaccattaattaacaatcagATTTCAGAAAT^ 

CCTTGGAAAAAGGTCAAATGCGAAAATGTGCTTAAGTGATTTTGAGAAACGCAAGCAAATATTTGCGG 

AATTCATCTACTGGCTATACAATTCGTTTATAATACCTATTTTACAATCTT'nTnTATATCACTGAATC 

AAGTGATTTACGAAATCGAACTGTTTATTTTAGAAAAGATATTTGGAAACTCTTGTGCCGACCCTTTAT 

TACATCAATGAAAATGGAAGCGTTTGAAAAAATAAACGAGgtattttaaagtattttttgcaaaaagctaatatmcagAACAA 

TGTTAGGATGGATACTCAGAAAACTACTTTGCCTCCAGCAGTTATTCGTCTATTACCTAAGAAGAATAC 

C7TTCGTCTCATTACGAA7TTAAGAAAAAGATTCTTAATAAAG 

cagATGGGTTCAAACAAAAAAATGTTAGTCAGTACGAACCAAACTTTACGACCTGTGGCATCGATACTG 
AAACATTTAATCAATGAAGAAAGTAGTGGTATTCCATTTAACTTGGAGGTTTACATGAAGCTTCTTACT 
TTTAAGAAGGATCTTCTTAAGCACCGAATGTTTGGgtaattatataatgcgcgattcctcattattaatmgcagGCGTAAGAAG 
TATTTTGTACGGATAGATATAAAATCCTGTTATGATCGAATAAAGCAAGATTTGATGTTTCGGATTGT^ 
AAAAAGAAACTCAAGGATCCCGAATTTGTAATTCGAAAGTATGCAACCATACATGCAACAAGTGACCG 
AGCTACAAAAAACTTTGTTAGTGAGGCGTTTTCCTATTgtaagma 

GGTGCCTTTTGAAAAAGTCGTGCAGTTACTTTCTATGAAAACATCAGATACTTTGTTTGTTGA 
GGATTATTGGACCAAAAGTTCTTCTGAAATTTTTAAAATGCTCAAGGAACATCTCTCTGGACACATTGT 
TAAGgtataccaangngaattgtaataacactaatgaaactagATAGGAAATTCTCAATACCTTCAAAAAGTTGGTATCCCTC 
AGGGCTCAATTCTGTCATCTTTTTTGTGTCATTTCTATATGGAAGATTTGATTGA 

TACGAAAAAGAAAGGATCAGTGTTGTTACGAGTAGTCGACGATTTCCTCTTTATAACAGTTAATAAAA 

AGGATGCAAAAAAATTTTTGAATTTATCTTTAAGAGgtgagttgctgtcattcctaagttctaaccgnga 

ACACAATTTTTCTACGAGCCTGGAGAAAACAGTAATAAACTTTGAAAATAGTAATGGGATAATAAACA 

ATACTTTTTTTAATGAAAGCAAGAAAAGAATGCCATTCTTCGGTTTCTCTGTGAACATGAGGTCTCTTG 

ATACATTGTTAGCATGTCCTAAAATTGATGAAGCCTTATTTAACTCTACATCTGTAGAGCTGACGAAAC 

ATATGGGGAAATCTTTTTTTTACAAAATTCT 

ATCCTTTGCACAAGTATTTATTGACATTACCCACAATTCAAAATTCAATTCTTGCTGCAATA TATA TAG 
GCTAGGATACTCTATGTGTATGAGAGCACAAGCATACTTAAAAAGGATGAAGGATATATTTATTCCCC 
AAAGAATGTTCATAACGGgtgagtacttattnaactagaaaagtcattaartaaccttagATCTTTTGAATGTTATTGGAAGAAAA 
ATTTGGAAAAAGTTGGCCGAAATATTAGGATATACGAGTAGGCGTTTCTTGTCCTCTGCAGAAGTCAA 
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ATGgtacgtgtcggtctcgagacucagcaatangacacatcagGCTTTTTTGTCTTGGAATGAGAGATGGTTTGAAACCCTCTT 

TCAAATATCATCCATGCTTCGAACAGCTAATATACCAATTTCAGTCATTGACTGATCTTATCAAGCCGC 

TAAGACCAGTTTTGCGACAGGTGTTATTTTTACATAGAAGAATAGCTGATTAAtgtcatm 

tattactggtgtcttaaacaatattattactaagtatagctgacccccaaagcaagcatactatagga 

atccttatactmaagaaagattgacagtggttgctgactactgcccacatgcccattaaacgggag^ 

aataaggaaagtggttttctataatgaataatgcccgcactaatgcaaaaagacgaagattatcttctaaacaagggggattaagcatatccgaaggaaaagagagtaatat 
acccagtgttgttgaagaaagcaaggataamggaacaagcttctgcagatgacaggctaaattttggtgaccgaattttggtaaaagccccaggttatccatggtggccg 
gccttgctactgagacgaaaagaaactaaggatagmgaatactaatagctcatttaat 

gtgnaagccattanggattccgaaatagccaaatttcrtggttcctcaaagcggaagtctaaagaacttattgaagcttatgaggcttcaaaaactcctcctgatttaaaggag 
gaatcttccaccgatgaggaaatggatagcttatcagctgctgaggagaagcctaattttttgcaaaaaagaaaatatcattgggagacaictcngatgaatcagatgcgga 
gagtatctccagcggatccttgatgtcaataacttctatttctgaaatgtatggtcctactgtcgcttcgacttctcgtagctctacgcagnaagtgaccaaaggtacc 
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EST2 pep 

Euplotes pep 

Trans of tetrahymen 

Consensus 



FFYCTEISST VTXVYFRHDT WN KLIT V FIVE YFK-TYLVEN 

FFYVTEQQKS Y^KTYYTOKN IWTJVT-MKMS IAD LKK ETLA — EVQE 

KKKE GSQIFYYRKP IWKLVSKLTI VKVRIQrSEK NKQMKMNFYQ 



FFY.TE . . K. .S . . YYYRK. IW...-KL F. .K 



40 
43 

50 



EST2 pep 

Euplotes pep 

Trans of tetrahymen 

Consensus 



.OKfc 



NVCRNHNSY TLSNFNH^I 

KEVEEWKKSL GFAPC^lit^i^ 

KIQLEEENLE KVEEKLIPED S? 

K . . . E . . . . F 



Rh|SS3feNNE 



tarn 



FRIIMPCRG 79 

i?RF*tfTFNKK 78 

EF f I yTTFLRK 92 

EskuKTF . RK 100 



EST2 pep 

Euplotes pep 

Trans of tetrahymen 

Consensus 



ADEEBFTIYK ENHKNAIQE^VQKILEVgRNK 
rVNSDRKTTK LTT^^^CL^I^^HLMIJCI I 
DKQKNIK LNU^IIl©^LVFro^KlS- 

. . . . [ .K. .K LN.N. .L . ."S^QL.L. .IeJkS- 



RPTSFTKIYS PTQIADR-IKE 129 

- RKFK -CPFGFAVFtt 120 

ML-G -QXIGYSVFD 130 

.IG. -VF'. 150 



EST2 pep 

Euplotes pep 

Trans of tetrahymen 

Consensus 



FKQRLL^KrN 



YD 
NK-i 



DVMFK YE 

ft 



NVL 

EFVCKWKQVG C 
■A QFIEKWKNKG 



.K-...HKF. .F.,KWK..G 




YFMKFD VKSCYD 
FATMD IEKCYD 
L 



T.D 



-CYD 



157 
155 
158 

186 
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S- 1 : FFY VTE TTF QKN RLF FYR KS V WSK 
S-2. ROHLKRVQLRDVSEAEVRQHREA 
S-3: ART FRR EKR AER LTS RVK ALF SVL NYE 

A-l: AiCF LHW LMS VYV VEL LRS FFY VTE TTF Q . 
A-2: LFF YRK SVW SKL QSI GIR QHL KRV QLR DVS 
A-3: PAL LTS RLR FIP KPD GLR PIV NMD YW 
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Vector 



Genomic DNA Insert 



A5 



B2 



S825 bp Sequenced 
-2 kb Hind HI Fragment 



Vector 



1 kb 



fezP 



RT Motifs 1 2 3(A) 4(B') 5(C) 6(D) 

1/ \ \ \^*~ 



Introns 



1 <:DNA 
3 k 5-20 



CDNA 



7 8 9 10 11 



Hind HI Xca I 



12 13 14 15 



Original PGR 
3' RT-PCR 




Xca I 



Hind lii 



RT-PCR w/ M2-814 
RT-PCR w/ M2-B15 Bdnc 
RT-PCR w/ M2-B15Bcnc 
RT-PCR w/ M2-B16BoYk 



500 bp 
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Poly 4 

t t c 

t a a g c c teg 
5'- cag acc aaa gga att cca taa gg -3' 
QTKGIPQG 



4<B') 



5(C) 

D D Y L L I T 

3'- ctg ctg atg gag gag tag tgg -5' 
a a aaaaa a a 
t t t t 
c c 
Poly 1 
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PCR Product M2 showed Reasonable Match 
with Other Telomerase Proteins 



Ot LCVSYILSSFYYANLEENALQFLRKESMDPEKPETNLLMRLT 

Ea_pl23 KGIPQGIiCVSSILSSFYYATIiEESSLGFLRDESMNPENPNVNLLMRLTDDYLLIT 

Sp__M2 SILSSFLCHFYMEDLIDEYLSFTKKK GSVLLRW 

Sc_J?103 DGLFQGSSLSAPIVDLVYDDLLEFYSEFKASPS QDTLILKLADDFLIIS 



* * * 
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TCA 


ATT 


CTG 


TCA 


TCT 
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TTG 


TGT CAT 


TTC 


TAT 


ATG 


^tc tgg 


ttt 


cct 


taa 


ggt 


agt 
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AGT 
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GAC 


AGT 
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AAA 
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Poly 1 
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3' RT PCR Strategy 



mRNA 



I AAAAAAAAAAAAAAAA . 
TTTTTTTTTTTTTTT ■ 



3' 



tQo 



1 . Synthesis of cONA with Qj Primer. 
mRNA 



5' 
3* 



I AAAAAAAAAAAAAAAA. 
irrTTTTTTTTTTTTT ■ 



15* 



2. First Round PCR Using Outside Primer and Qq Primer. 



3. Second Round PCR Using Inside Primer and Q\ Primer. 



I TTTTTTTTTTTTTTT I 



GO 



4. Sequence Second Round PCR Products Using Inside Primer or Q| Primer. 
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A 

-Size Selected Libraries from P. Nurese 
3-4kb 
5~6kb 
7 ~8kb 
11 - 12 kb 

-Libraries from J.A. Wise 
Sau 3a Partial Digest 
Hind III Partial Digest 

cDNA Libraries 

GAD (Gal Activation Domain) Library 
REP Library from R. Allshire 
REP81ES Library (old) 

% REP81ES Library (new) 

1 REP41ES Library 
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JSOriginal PCR H 




3' RT-PCR HH 




m + f 






500 bp 



Mg Cone. 
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A5 B2 




ill 
Q 
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5' RT PCR Strategy 



5' 



mRNA 



I AAAAAAAAAAAAAAAA . 



ddA- 



1 . Synthesis of cDNA with Specific Downstream Primer. 

mRNA 
5' ■■■■■■■■■ 

3' MHDHBHBnB 



I AAAAAAAAAAAAAAAA . 



2. Ligate Oligo with 5'-P and blocked 3* to cDNA using T4 RNA ligase. 




4. Second Round PCR 
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Alignment of RT Domains from Telomerase Catalytic Subunits. 

Motif 0 

S.p. Tezlp (429). WLYNSFIIPILQSFFYITESSDLRNRTVYFRXDIW ...(35)... 
S.C. Est2p (366). WLFRQLIPKIIQTFFYCTEISSTVT-I\ r YFRHDT\Aj ...(35)... 
E.a. pi 23 (441). WIFEDLWSLIRCFFYVTEQQKSYSKTYYYRKNIW ...(35)... 

* * * * * ★ * * * 

Motif 1 Motif 2 K 
p hh h K hR h R 
S.p. Tezlp AVIRLLPKK— NTFRLITN-LRKRF ...(61)... 
S.C. Est2p SKMRI I PKKSNNEFRI I AI PCRGAD ...(62)... 
E.a. pl23 GKLRLIPKK— TTFRPIMTFNKKIV ...(61)... 

* * * * * * -A- 

Motif 3 (A) AF 

h hDh GY h 

S.p. Tezlp K K V FVRI D I KSC YDR I KQDLMFR I VK . . . (89) . . . 

S.C. Est2p E1YFMKFDVKSCYDSIPRMECMRILK . . . (75) . . . 

E.a. pl23 K1FFATMDIEKCYDSVNREKLSTFLK . . . (107) . . . 
***** * 

Motif 4 (B' ) 

hPQG pP hh h 

S.p. Tezlp YLQKVGIPQGSILSSFLCHFYMEDLIDEYLSF ...(6)... 

S.c. Est2p YIREDGLFQGSSLSAPIVDLVYDDLLEFYSEF ...(8)... 

E.a. pl23 YKQTKG I PQGLCV S S I L S SF YYATLE ESS L-GF ...(14)... 
***** * * 

Y Motif 5(C) Motif 6(D) 
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S.p. Tezlp VI LRWDD FLF I TVNKKD AKKFLNLS LRG F E KHNF S T S L E KTV INFENS . ( 205 ) 
S.c. Est2p LILKIJ^DFLIISTIXX2QVINIKK^ .(173) 
E.a. pl23 LLMRLTDDYLLITTQENNAVLFIEKLIWSRENGFKF%JMKKLQTSFPLS . (209) 
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FIGURE 43 

Disruption strategy for the putative telomerase genes. 
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wild type telomerase gene 



1 . Transform with linear fragment 
containing the telomerase gene 
disrupted with a LEU2 or uro4 marker. 
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telomerase gene::M 
wild type telomerase gene 




2. Assay in selective media. 

3. Sporulate, and grow on 
selective media. 



wild type telomerase gene 



Wild Type (X2) 




Senescence? (X2) 



(These cells will show a senescence phenotype 

if the disrupted gene encodes a telomerase subunit.) 
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FIGURE 44 



An Example of Confirmation of tezl disruption By PCR 





FIGURE 



Tezl disruption causes progressive 
shortening of telomeres in S. pombe 
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FIGURE 46 



1 ggtaccgatttactttcctttcttcataagctaattgcttcctcgaacgctcctaaatctctggaaatacttttacaaga 80 
81 actcaataacaataccaagtcaaattccaatatgaaggtgttattagtgatcgataatatttctattttatcggtcgtta 160 
161 ccaagtataaggacaaaaagaacaacttccttccccctaaagacttttactttattaatt tact tttcaaatatatt teg 240 
241 ggttcgcttacttttaatcgtggtactgttttagctgctacttctagccaaccgcgtgtttctaccccgtcattggatat 320 
321 agctcttggagtagctcacagaaatccttacaaatcttctgatgagactatattagattcattacagtccgtgcatattc 400 
401 ttaacatggagccttacactttagatgagtcacgtcgcatgatggagtatttggtatcatccaacgtttgccttgaaaag 480 
481 gttgataattatttgcaaaatcatgt'ccttagtggtggtaatccgcgaaagttttttgatgcttgcacacgtctagcatg 560 
561 attgagatattcaaaaatttctatccactacaactcctttaacgcggttttatttttctattttctattctcatgttgtt 640 
641 ccaaatatgtatcatctcgtattaggcttttttccgttttactcctggaatcgtacctttttcactattccccctaatga 720 
721 ataatctaaattagtttcgcttataattgatagtagtagaaagattggtgattctactcgtgtaatgttattagtttaaa 800 
801 gatactttgcaaaacatttattagctatcattatataaaaaaaatcctataattataaatattaatcaatatttgcggtc 880 
881 actatt tat ttaaaacgttatgatcagtaggacactttgcatatatatagttatgcttaatggt tact tgtaacttgc 958 

959 ATG ACC GAA CAC CAT ACC CCC AAA AGO AGG ATT CTT CGC TTT CTA GAG AAT CAA TAT GTA 1018 
1MTEHHTPKS R I L R FLENQYV 20 

1019 TAC CTA TGT ACC TTA AAT GAT TAT GTA CAA CTT GTT TTG AGA GGG TCG CCG GCA AGC TCG 1078 
'■Si YLCTLNDYVQLVLRGS PASS 40 

1^79 TAT AGC AAT ATA TGC GAA CGC TTG AGA AGC GAT GTA CAA ACG TCC TTT TCT ATT TTT CTT 1138 
;%41 YSNICERLRSDVQTSFSIFL 60 



14:39 CAT TCG ACT GTA GTC GGC TTC GAC AGT AAG CCA GAT GAA GGT GTT CAA TTT TCT TCT CCA 
^61 HSTVVGFDSKPDEGVQFSSP 



80 



1198 



FIGURE 46 (cont.) 

1199 AAA TGC TCA CAG TCA GAG gtatatatatttttgttttgatttttttctattcgggatagctaatatatgggcag 1272 

81 K C S Q S E 86 

1273 CTA ATA GCG AAT GTT GTA AAA CAG ATG TTC GAT GAA AGT TTT GAG CGT CGA AGG AAT CTA 1332 

87LIANV VKQMFDESFERRRNL 106 

im ttg ATG AAA GGG TTT TCC ATG gtaaggtattctaattgtgaaatatttacctgcaattactgtttcaaagaga 1405 
107 L M K G F S M 

1406 ttgtatttaaccgataaag AAT CAT GAA GAT TTT CGA GCC ATG CAT GTA AAC GGA GTA CAA AAT 1469 



114 



NHEDFRAMHVNGVQN 128 



147 0 GAT CTC GTT TCT ACT TTT CCT AAT TAC CTT ATA TCT ATA CTT GAG TCA AAA AAT TGG CAA 1529 
129 DLVSTFPNYL ISI LESKNWQ 148 
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2337 TTC TTG AAA TTA TCG AGA TAG GAG TCT TTT AGT TTA CAT TAT TTA ATG AGT AAC ATA AAG 23 96 

376 FLKLSRYESFSLHYLMSNIK 395 

2397 gtaatatgccaaatttttttaccattaattaacaatcag ATT TCA GAA ATT GAA TGG CTA GTC CTT GGA 246S 

396 ISEIEWLVLG 405 

2466 AAA AGG TCA AAT GCG AAA ATG TGC TTA AGT GAT TTT GAG AAA CGC AAG CAA ATA TTT GCG 2525 

406 KRSNAKMCLSDFEKRKQIFA 425 

2526 GAA TTC ATC TAC TGG CTA TAC AAT TCG TTT ATA ATA CCT ATT TTA CAA TCT TTT TTT TAT 2585 

426 EFIYWLYNSFI I PI LQSFFY 445 

2586 ATC ACT GAA TCA AGT GAT TTA CGA AAT CGA ACT GTT TAT TTT AGA AAA GAT ATT TGG AAA 264 5 

446 I TESSDLRNRTVYFRKDIWK 465 

2646 CTC TTG TGC CGA CCC TTT ATT ACA TCA ATG AAA ATG GAA GCG TTT GAA AAA -ATA AAC GAG 2705 

466 LLCRPFITSMKMEAFEKINE 485 

, :?f:v 2706 gtattttaaagtattttttgcaaaaagctaatattttcag AAC AAT GTT AGG ATG GAT ACT CAG AAA ACT 2775 

"J 486 NNVRMDTQKT 495 

22776 ACT TTG CCT CCA GCA GTT ATT CGT CTA TTA CCT AAG AAG AAT ACC TTT CGT CTC ATT ACG 2835 

f " 496 TLPPAVIRLLPKKNTFRL I T 515 

f!2836 AAT TTA AGA AAA AGA TTC TTA ATA AAG gtattaatttttggtcatcaatgtactttact tctaatctatta 2906 

2 516 N L R K R F L I K 524 

2907 ttagcag ATG GGT TCA AAC AAA AAA ATG TTA GTC AGT ACG AAC CAA ACT TTA CGA CCT GTG 2967 

3 525 MGSNKKMLVS TNQTLR PV 542 

2968 GCA TCG ATA CTG AAA CAT TTA ATC AAT GAA GAA AGT AGT GGT ATT CCA TTT AAC TTG GAG 3027 

F ! 543 ASILKHLINEESSGIPFNLE 562 

I 3028 GTT TAC ATG AAG CTT CTT ACT TTT AAG AAG GAT CTT CTT AAG CAC CGA ATG TTT GG gtaat 3088 

563 VYMKLLTFKKDLLKHRMFG 581 

3089 tatataatgcgcgattcctcattattaattttgcag G CGT AAG AAG TAT TTT GTA CGG ATA GAT ATA 3155 

582 RKKYFVRIDI 591 

3156 AAA TCC TGT TAT GAT CGA ATA AAG CAA GAT TTG ATG TTT CGG ATT GTT AAA AAG AAA CTC 3215 

592 KSCYDRIKQDLMFRIVKKKL 611 

3216 AAG GAT CCC GAA TTT GTA ATT CGA AAG TAT GCA ACC ATA CAT GCA ACA AGT GAC CGA GCT 327 5 

612 KDPEFVIRKYATIHATSDRA 631 

3276 ACA AAA AAC TTT GTT AGT GAG GCG TTT TCC TAT T gtaagtttattttttcattggaattttttaacaa 3 34 3 

632 TKNFVSEAFSYF 643 

3344 attcttttttag TT GAT ATG GTG CCT TTT GAA AAA GTC GTG CAG TTA CTT TCT ATG AAA ACA 340S 

644 DMVPFEKVVQLLSMKT 659 

3406 TCA GAT ACT TTG TTT GTT GAT TTT GTG GAT TAT TGG ACC AAA AGT TCT TCT GAA ATT TTT 34 65 

660 SDTLFVDFVDYWTKSSSE I F 679 



3466 AAA ATG CTC AAG GAA CAT CTC TCT GGA CAC ATT GTT AAG gtataccaa ttgt tgaattgtaataaca 3 532 
680 KMLKEHLSGHIVK 692 



FIGURE 46 (cont) 

3533 ctaatgaaactag ATA GGA AAT TCT CAA TAC CTT CAA AAA GTT GGT ATC CCT CAG GGC TCA 3593 

693 IGNSQYLQKVGIPQGS 708 

3594 ATT CTG TCA TCT TTT TTG TGT CAT TTC TAT ATG GAA GAT TTG ATT GAT GAA TAC CTA TCG 3 653 

709 I LSSFLCHFYMEDLIDEYLS 728 

3 654 TTT ACG AAA AAG AAA GGA TCA GTG TTG TTA CGA GTA GTC GAC GAT TTC CTC TTT ATA ACA 3713 
729 FTKKKGSVLLRVVDDFLFIT 748 

3714 GTT AAT AAA AAG GAT GCA AAA AAA TTT TTG AAT TTA TCT TTA AGA G gtgagt tgctgtca t tec 3777 

749 VNKKDAKKFLNLSLRG 764 

3778 taagttctaaccgttgaag GA TTT GAG AAA CAC AAT TTT TCT ACG AGC CTG GAG AAA ACA GTA 3840 

765 FEKHNFSTSLEKTV 778 

3841 ATA AAC TTT GAA AAT AGT AAT GGG ATA ATA AAC AAT ACT TTT TTT AAT GAA AGC AAG AAA 3900 

779 INFENSNGI I NNTFFNESK K 798 

3901 AGA ATG CCA TTC TTC GGT TTC TCT GTG AAC ATG AGG TCT CTT GAT ACA TTG TTA GCA TGT 3960 

Q799 RMPFFGFSVNMRSLDTLLAC 818 

|f 9 61 CCT AAA ATT GAT GAA GCC TTA TTT AAC TCT ACA TCT GTA GAG CTG ACG AAA CAT ATG GGG 4020 

,£819 PKIDEALFNSTSVELTKHMG 838 

m021 AAA TCT TTT TTT TAC AAA ATT CTA AG gtatactgtgtaactgaataatagctgacaaataatcag A TCG 4089 

y839 KSFFYKILR S 848 

"4 090 AGC CTT GCA TCC TTT GCA CAA GTA TTT ATT GAC ATT ACC CAC AAT TCA AAA TTC AAT TCT 4149 

n S49 SLASFAQVFIDITHNSKFNS 868 

^150 TGC TGC AAT ATA TAT AGG CTA GGA TAC TCT ATG TGT ATG AGA GCA CAA GCA TAC TTA AAA 4209 

^=2 869 CCNIYRLGY SMCMRAQAYLK 888 

l i! %2!0 AGG ATG AAG GAT ATA TTT ATT CCC CAA AGA ATG TTC ATA ACG G gtgagtacttat tttaactaga 4274 

889 RMKDIFIPQRMFITD 903 

4275 aaagtcattaattaaccttag AT CTT TTG AAT GTT ATT GGA AGA AAA ATT TGG AAA AAG TTG GCC 4339 

904 LLNVIGRKIWKKLA 917 

4340 GAA ATA TTA jGGA TAT ACG AGT AGG CGT TTC TTG TCC TCT GCA GAA GTC AAA TG gtacgtgtc 4401 

918 EILGYTSRRFLSSAEVKW 935 

4402 ggtctcgagacttcagcaatattgacacatcag G CTT TTT TGT CTT GGA ATG AGA gat GGT TTG AAA 4468 

936 LFCLGMRDGLK 946 

4469 CCC TCT TTC AAA TAT CAT CCA TGC TTC GAA CAG CTA ATA TAC CAA TTT CAG TCA TTG ACT 4528 

947 PSFKYHPCFEQLIYQFQSLT 966 

4 529 GAT CTT ATC AAG CCG CTA AGA CCA GTT TTG CGA CAG GTG TTA TTT TTA CAT AGA AGA ATA 4588 
967 DLIKPLRPVLRQVLFLHRRI 986 



4589 GCT GAT TAA tgtcattttcaatttattatatacatcctt tat tactggtgtcttaaacaatat tattactaagtata 4665 
987 A D * 989 
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4666 gctgacccccaaagcaagcatactataggatttctagtaaagtaaaattaatctcgttattagttttgattgacttgtct 474 5 
474 6 ttatccttatacttttaagaaagattgacagtggttgctgactactgcccacatgcccattaaacgggagtggttaaaca 4825 
4826 ttaaaagtaatacatgaggctaatctcctttcatttagaataaggaaagtggttttctataatgaataatgcccgcacta 4905 

4906 atgcaaaaagacgaagattatcttctaaacaagggggattaagcatatccgaaggaaaagagagtaatatacccagtgtt 4 985 
4986 gt tgaagaaagcaaggataatttggaacaagcttctgcagatgacaggctaaattttggtgaccgaattttggtaaaagc 5065 
5066 cccaggttatccatggtggccggccttgctactgagacgaaaagaaactaaggatagtttgaatactaatagctcattta 5145 
514 6 atgtcttatataaggttttgttttttcctgacttcaattttgcatgggtgaaaagaaatagtgttaagccattattggat 522 5 
522 6 tccgaaatagccaaatttcttggttcctcaaagcggaagtctaaagaacttattgaagcttatgaggcttcaaaaactcc 5305 
5306 tcctgatttaaaggaggaatcttccaccgatgaggaaatggatagcttatcagctgctgaggagaagcctaattttttgc 5385 
538 6 aaaaaagaaaatatcattgggagacatctcttgatgaatcagatgcggagagtatctccagcggatccttgatgtcaata 54 65 
54 66 acttctatttctgaaatgtatggtcctactgtcgcttcgacttctcgtagctctacgcagttaagtgaccaaaggtacc 5544 



FIGURE 47 



1 

met ser val tyr val val glu leu leu 
GC CAAGTT CCTG CACTGGCTG ATG AGT GTG TAC GTC GTC GAG CTG CTC 



arg ser phe phe tyr val thr glu thr thr phe gin lys asn arg 

AGG TCT TTC TTT TAT GTC ACG GAG ACC ACG TTT CAA AAG AAC AGG 

30 

leu phe phe tyr arg lys ser val trp ser lys leu gin ser ile 

CTC TTT TTC TAC CGG AAG AGT GTC TGG AGC AAG TTG CAA AGC ATT 

40 50 

gly ile arg gin his leu lys arg val gin leu arg glu leu ser 

GGA ATC AGA CAG CAC TTG AAG AGG GTG CAG CTG CGG GAG CTG TCG 

GO 

glu ala glu val arg gin his arg glu ala arg pro ala leu leu 

GAA GCA GAG GTC AGG CAG CAT CGG GAA GCC AGG CCC GCC CTG CTG 

70 80 

thr ser arg leu arg phe ile pro lys pro asp gly leu arg pro 

ACG TCC AGA CTC CGC TTC ATC CCC AAG CCT GAC GGG CTG CGG CCG 

90 

ile val asn met asp tyr val val gly ala arg thr phe arg arg 

ATT GTG AAC ATG GAC TAC GTC GTG GGA GCC AGA ACG TTC CGC AGA 

100 110 

glu lys ala glu arg leu thr ser arg val lys ala leu phe 

GAA AAG ARG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG TTC 

120 

ser val leu asn tyr glu arg ala arg arg pro gly leu leu gly 

AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC GGC CTC CTG GGC 

130 140 

ala ser val leu gly leu asp asp ile his arg ala trp arg thr 
GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC TGG CGC ACC 

150 

phe val leu arg val arg ala gin asp pro pro pro glu leu tyr 
TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG CTG TAC 

160 170 

phe val lys val asp val thr gly ala tyr asp thr ile pro gin 
TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC CAG 



10 



20 



180 

asp arg leu thr glu val ile ala ser ile ile lys pro gin asn 
GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG AAC 
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190 



200 



thr tyr cys val arg arg tyr ala val val gin lys ala ala met 
ACG TAC TGC GTG CGT CGG TAT GCC GTG GTC CAG AAG GCC GCC ATG 

210 

gly thr ser ala arg pro ser arg ala thr ser tyr val gin cys 
GGC ACG TCC GCA AGG CCT TCA AGA GCC ACG TCC TAC GTC CAG TGC 

220 230 

gin gly ile pro gin gly ser ile leu ser thr leu leu cys ser 
CAG GGG ATC CCG CAG GGC TCC ATC CTC TCC ACG CTG CTC TGC AGC 

240 

leu cys tyr gly asp met glu asn lys leu phe ala gly ile arg 
CTG TGC TAC GGC GAC ATG GAG AAC AAG CTG TTT GCG GGG ATT" CGG 

250 260 

arg asp gly leu leu leu arg leu val asp asp phe leu leu val 
CGG GAC GGG CTG CTC CTG CGT TTG GTG GAT GAT TTC TTG TTG GTG 

270 

thr pro his leu thr his ala lys thr phe leu arg thr leu val 
ACA CCT CAC CTC ACC CAC GCG AAA ACC TTC CTC AGG ACC CTG GTC 

280 290 

arg gly val pro glu tyr gly cys val val asn leu arg lys thr 
CGA GGT GTC CCT GAG TAT GGC TGC GTG GTG AAC TTG CGG AAG ACA 

300 

val val asn phe pro val glu asp glu ala leu gly gly thr ala 
GTG GTG AAC TTC CCT GTA GAA GAC GAG GCC CTG GGT GGC ACG GCT 

310 320 

phe val gin met pro ala his gly leu phe pro trp cys gly leu 
TTT GTT CAG ATG CCG GCC CAC GGC CTA TTC CCC TGG TGC GGC CTG 

330 

leu leu asp thr arg thr leu glu val gin ser asp tyr ser ser 
CTG CTG GAT ACC CGG ACC CTG GAG GTG CAG AGC GAC TAC TCC AGC 

340 350 

tyr ala arg thr ser ile arg ala ser leu thr phe asn arg gly 
TAT GCC CGG ACC TCC ATC AGA GCC AGT CTC ACC TTC AAC CGC GGC 

360 

phe lys ala gly arg asn met arg arg lys leu phe gly val leu 
TTC AAG GCT GGG AGG AAC ATG CGT CGC AAA CTC TTT GGG GTC TTG 



370 380 

arg leu lys cys his ser leu phe leu asp leu gin val asn ser 
CGG CTG AAG TGT CAC AGC CTG TTT CTG GAT TTG CAG GTG AAC AGC 



m 
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390 

leu gin thr val cys thr asn ile tyr lys ile leu leu leu gin 
CTC CAG ACG GTG TGC ACC AAC ATC TAC AAG ATC CTC CTG CTG CAG 



ala tyr arg phe his ala cys val leu gin leu pro phe his gin 
GCG TAC AGG TTT CAC GCA TGT GTG CTG CAG CTC CCA TTT CAT CAG 

420 

gin val trp lys asn pro his phe ser cys ala ser ser leu thr 
CAA GTT TGG AAG AAC CCA CAT TTT TCC TGC GCG TCA TCT CTG ACA 

430 440 

arg leu pro leu leu leu his pro glu ser gin glu arg arg asp 
CGG CTC CCT CTG CTA CTC CAT CCT GAA AGC CAA GAA CGC AGG .GAT 

450 

val ala gly gly gin gly arg arg arg pro ser ala leu arg gly 
GTC GCT GGG GGC CAA GGG CGC CGC CGG CCC TCT GCC CTC CGA GGC 

460 470 

arg ala val ala val pro pro ser ile pro ala gin ala asp ser 

CGT GCA GTG GCT GTG CCA CCA AGC ATT CCT GCT CAA GCT GAC TCG 

480 

thr pro cys his leu arg ala thr pro gly val thr gin asp ser 
ACA CCG TGT CAC CTA CGT GCC ACT CCT GGG GTC ACT CAG GAC AGC 

490 500 

pro asp ala ala glu ser glu ala pro gly asp asp ala asp cys 
CCA GAC GCA GCT GAG TCG GAA GCT CCC GGG GAC GAC GCT GAC TGC 

510 

pro gly gly arg ser gin pro gly thr ala leu arg leu gin asp 
CCT GGA GGC CGC AGC CAA CCC GGC ACT GCC CTC AGA CTT CAA GAC 

520 530 

his pro gly leu met ala thr arg pro gin pro gly arg glu gin 
CAT CCT GGA CTG ATG GCC ACC CGC CCA CAG CCA GGC CGA GAG CAG 

540 

thr pro ala ala leu ser arg arg ala tyr thr ser gin gly gly 
ACA CCA GCA GCC CTG TCA CGC CGG GCT TAT ACG TCC CAG GGA GGG 

550 560 

ar 9 giy giy P ro his P ro giy i eu his ar 9 tr P gi u ser g iu aia 

AGG GGC GGC CCA CAC CCA GGC CTG CAC CGC TGG GAG TCT GAG GCC 

564 
OP 

TGA GTGAGTGTTTGGCCGAGGCCTGCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAGGC 



400 



410 



CTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTCCAGCACACCTGCGTTTTCACTTCCCCAC 
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AGGCTGGCGTTCGGTCCACCCCAGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTCCACT 
CCCCACATAGGAATAGTCCATCCCCAGATTCGCCATTGTTCACCCTTCGCCCTGCCTTCC 
TTTGCCTTCCACCCCCACCATTCAGGTGGAGACCCTGAGAAGGAGCCTGGGAGCTTTGGG 
AATTTGGAGTGACCAAAGGTGTGCCCTGTACACAGGCGAGGACCCTGCACCTGGATGGGG 
GTCCCTGTGGGTCAAATTGGGGGGAGGTGCTGTGGGAGTAAAATACTGAATATATGAGTT 
TTTCAGTTTTGGAAAAAAAAAAAAAAAAAAAAAAAAAA 
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Motif -1 

Ep pi 23 ...LVVSLIRCFFYVTEQQKSYSKT... 

Sp Tezl ...FIIPILQSFFYITESSDLRNRT... 

Sc Est2 ...LIPKIIQTFFYCTEISSTVTIV... 

HsTCPl ...YVVELLRSFFYVTETTFQKNRL... 
consensus FFY TE 

K 

Motif 0 phhh K hRh R 

Ep pl23 . . . KSLGF APGKLRLI PKKT--TFRPIMTFNKKI V . . . 

Sp Tezl . . . QKTTLPP A VIRLLPKJCN--TFRLITNLRKRFL. . . 

Sc Est2 ...TLSNFNHSKMRIIPKKSNNEFRIIAIPCRGAD... 

Hs TCP1 ...ARPALLTSRLRFIPKPD-GLRPIVNMDYVVG.. 
consensus R PK R I 

AF 

Motif A h hDh GY h 

Ep pi 23 . . . PKLFFATMDIEKC YDS VNREKLSTFLK. . . 

Sp Tezl ...RKKYFVRIDIKSCYDRIKQDLMFRJVK... 

Sc Est2 ...PELYFMKFDVKSCYDSIPRMECMRILK... 

Hs TCP1 . . . PEL YFVK VD VTGA YDTIPQDRLTEV1 A.. J/... 
consensus F D YD 

Motif B hPQG pShh 

Ep pl23 . . .NGKF YKQTKGIPQGLC VSSILSSF Y Y A. . . 

Sp Tezl ...GNSQYLQKVGIPQGSILSSFLCHFYME... 

Sc Est2 . . . EDKC YIREDGLFQGS SLS AP I VDL V YD . . . 

Hs TCP1 ...RATSYVQCQGIPQGSILSTLLCSLCYG... 
consensus G QG S 

Y 

Motif C h FDDhhh 

Eppl23 ...PNVNLLMRLTDDYLLITTQENN... 

Sp Tezl ...KKGSVLLRVVDDFLFITVNKKD... 

Sc Est2 ...SQDTLILKLADDFLIISTDQQQ... 

HsTCPl ...RRDGLLLRLVDDFLLVTPHLTH... 
consensus DD L 

Motif D Ghh cK 

Eppl23 ...NVSRENGFKFNMKKL... 

Sp Tezl ...LNLSLRGFEKHNFST... 

Sc Est2 ...KKLAMGGFQKYNAKA... 

HsTCPl ...LRTLVRGVPEYGCVV... 
consensus G 
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FIGURE 50 



1 GCAGCGCTGC GTCCTGCTGC GCACGTGGGA AGCCCTGGCC CCGGCCACCC 
5 1 CCGCGATGCC GCGCGCTCCC CGCTGCCGAG CCGTGCGCTC CCTGCTGCGC 
1 0 1 AGCC ACTACC GCGAGGTGCT GCCGCTGGCC ACGTTCGTGC GGCGCCTGGG 
1 5 1 GCCCCAGGGC TGGCGGCTGG TGCAGCGCGG GGACCCGGCG GCTTTCCGCG 
201 CGNTGGTGGC CCANTGCNTG GTGTGCGTGC CCTGGGANGN ANGGCNGCCC 
25 1 CCCGCCGCCC CCTCCTTCCG CCAGGTGTCC TGCCTGAANG ANCTGGTGGC 
301 CCGAGTGCTG CANANGCTGT GCGANCGCGG CGCGAANAAC GTGCTGGCCT 
3 5 1 TCGGCTTCGC GCTGCTGGAC GGGGCCCGCG GGGGCCCCCC CGAGGCCTTC 
40 1 ACC ACCAGCG TGCGCAGCTA CCTGCCCAAC ACGGTGACCG ACGCACTGCG 
45 1 GGGGAGCGGG GCGTGGGGGC TGCTGCTGCG CCGCGTGGGC GACGACGTGC 
50 1 TGGTTCACCT GCTGGCACGC TGCGCGNTNT TTGTGCTGGT GGNTCCCAGC 
55 1 TGCGCCTACC ANGTGTGCGG GCCGCCGCTG TACCAGCTCG GCGCTGCNAC 
601 TCAGGCCCGG CCCCCGCCAC ACGCTANTGG ACCCGAANGC GTCTGGGATC 
65 1 CAACGGGCCT GGAACCATAG CGTCAGGGAG GCCGGGGTCC CCCTGGGCTG 
70 1 CC AGCCCCGG GTGCGAGGAG GCGCGGGGGC AGTGCCAGCC G AAGTCTGCC 
75 1 GTTGCCCAAG AGGCCCAGGC GTGGCGCTGC CCCTGAGCCG GAGCGGACGC 
80 1 CCGTTGGGCA GGGGTCCTGG GCCC ACCCGG GCAGGACGCC TGGACCGAGT 
85 1 GACCGTGGTT TCTGTGTGGT GTCACCTGCC AGACCCGCCG AAGAAGCCAC 
90 1 CTCTTTGGAG GGTGCGCTCT CTGGCACGCG CCACTCCC AC CCATCCGTGG 
95 1 GCCGCCAGCA CCACGCGGGC CCCCC ATCCA CATCGCGGCC ACCACGTCCT 
1001 GGGACACGCC TTGTCCCCCG GTGTACGCCG AGACCAAGCA CTTCCTCTAC 
1 05 1 TCCTCAGGCG AC AAGNACAC TGCGNCCCTC CTTCCTACTC AATATATCTG 
1101 AGGCCC AGCC TG ACTGGCGT TCGGGAGGTT CGTGGAGACA NTCTTTCTGG 
1151 TTCC AGGCCT TGGATGCCAG GATTCCCCGC AGGTTGCCCC GCCTGCCCC A 
1201 GCGNTACTGG C AAATGCGGC CCCTGTTTCT GGAGCTGCTT GGGAACC ACG 
1251 CGCAGTGCCC CTACGGGGTG TTCCTCAAGA CGCACTGCCC GCTGCGAGCT 
1301 GCGGTCACCC CAGCAGCCGG TGTCTGTGCC CGGGAGAAGC CCCAGGGCTC 
1351 TGTGGCGGCC CCCGAGGAGG AGGAACACAG ACCCCCGTCG CCTGGTGCAG 
1401 CTGCTCCGCC AGCACAGCAG CCCCTGGCAG GTGTACGGCT TCGTGCGGGC 
1451 CTGCCTGCGC CGGCTGGTGC CCCCAGGCCT CTGGGGCTCC AGGCACAACG 
1501 AACGCCGCTT CCTCAGGAAC ACCAAGAAGT TCATCTCCCT GGGGAAGCAT 
1551 GCCAAGCTCT CGCTGCAGGA GCTGACGTGG AAGATGAGCG TGCGGGACTG 
1 60 1 CGCTTGGCTG CGC AGGAGCC CAGGGGTTGG CTGTGTTCCG GCCGCAGAGC 
1 65 1 ACCGTCTGCG TGAGG AGATC CTGGCCAAGT TCCTGCACTG GCTGATGAGT 
1701 GTGTACGTCG TCGAGCTGCT C AGGTCTTTC TTTTATGTCA CGGAGACC AC 
1 75 1 GTTTCAAAAG AAC AGGCTCT TTTTCTACCG GAAGAGTGTC TGGAGC AAGT 
1801 TGC AAAGC AT TGGAATCAGA C AGC ACTTG A AGAGGGTGCA GCTGCGGGAG 
1851 CTGTCGGAAG CAGAGGTCAG GCAGCATCGG GAAGCC AGGC CCGCCCTGCT 
1901 GACGTCCAGA CTCCGCTTCA TCCCCAAGCC TGACGGGCTG CGGCCGATTG 
1951 TGAACATGGA CTACGTCGTG GGAGCCAGAA CGTTCCGCAG AGAAAAGAGG 
200 1 GCCG AGCGTC TC ACCTCGAG GGTGAAGGCA CTGTTC AGCG TGCTCAACTA 
205 1 CGAGCGGGCG CGGCGCCCCG GCCTCCTGGG CGCCTCTGTG CTGGGCCTGG 
2101 ACGATATCCA CAGGGCCTGG CGCACCTTCG TGCTGCGTGT GCGGGCCCAG 
2151 GACCCGCCGC CTG AGCTGTA CTTTGTC AAG GTGGATGTGA CGGGCGCGTA 
220 1 CGACACC ATC CCCCAGG AC A GGCTCACGGA GGTC ATCGCC AGCATCATCA 
225 1 AACCCCAGAA CACGTACTGC GTGCGTCGGT ATGCCGTGGT CCAG AAGGCC 
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2301 GCCCATGGGC ACGTCCGCAA GGCCTTCAAG AGCCACGTCT CTACCTTGAC 
235 1 AGACCTCCAG CCGTACATGC GACAGTTCGT GGCTCACCTG CAGGANAACA 
2401 GCCCGCTGAG GGATGCCGTC GTCATCGAGC AGAGCTCCTC CCTGAATGAG 
245 1 GCCAGCAGTG GCCTCTTCGA CGTCTTCCTA CGCTTCATGT GCCACC ACGC 
250 1 CGTGCGCATC AGGGGCAAGT CCTACGTCCA GTGCC AGGGG ATCCCGCAGG 
255 1 GCTCCATCCT CTCCACGCTG CTCTGCAGCC TGTGCTACGG CGACATGGAG 
260 1 AACAAGCTGT TTGCGGGGAT TCGGCGGGAC GGGCTGCTCC TGCGTTTGGT 
2651 GGATGATTTC TTGTTGGTGA CACCTCACCT CACCCACGCG AAAACCTTCC 
270 1 TCAGGACCCT GGTCCGAGGT GTCCCTGAGT ATGGCTGCGT GGTGAACTTG 
275 1 CGGAAGACAG TGGTGAACTT CCCTGTAGAA GACGAGGCCC TGGGTGGCAC 
280 1 GGCTTTTGTT CAGATGCCGG CCCACGGCCT ATTCCCCTGG TGCGGCCTGC 
285 1 TGCTGGATAC CCGGACCCTG GAGGTGCAGA GCGACTACTC CAGCTATGCC 
290 1 CGGACCTCC A TC AGAGCCAG TCTC ACCTTC AACCGCGGCT TC AAGGCTGG 
295 1 G AGG AACATG CGTCGCAAAC TCTTTGGGGT CTTGCGGCTG AAGTGTC ACA 
3001 GCCTGTTTCT GGATTTGCAG GTGAACAGCC TCCAGACGGT GTGCACCAAC 
305 1 ATCTACAAGA TCCTCCTGCT GCAGGCGTAC AGGTTTCACG CATGTGTGCT 
3101 GCAGCTCCCA TTTC ATCAGC AAGTTTGGAA GAACCCCACA TTTTTCCTGC 
3151 GCGTCATCTC TGACACGGCC TCCCTCTGCT ACTCCATCCT GAAAGCCAAG 
3201 AACGCAGGGA TGTCGCTGGG GGCCAAGGGC GCCGCCGGCC CTCTGCCCTC 
3251 CG AGGCCGTG C AGTGGCTGT GCCACC AAGC ATTCCTGCTC AAGCTGACTC 
3301 GACACCGTGT CACCTACGTG CCACTCCTGG GGTCACTCAG GACAGCCCAG 
3351 ACGCAGCTGA GTCGGAAGCT CCCGGGGACG ACGCTGACTG CCCTGGAGGC 
340 1 CGCAGCCAAC CCGGCACTGC CCTCAGACTT CAAGACCATC CTGG ACTGAT 
345 1 GGCCACCCGC CCACAGCCAG GCCGAGAGCA GACACCAGCA GCCCTGTCAC 
3501 GCCGGGCTCT ACGTCCCAGG GAGGGAGGGG CGGCCCACAC CCAGGCCCGC 
3551 ACCGCTGGGA GTCTGAGGCC TGAGTGAGTG TTTGGCCGAG GCCTGCATGT 
3601 CCGGCTGAAG GCTGAGTGTC CGGCTGAGGC CTGAGCGAGT GTCCAGCCAA 
365 1 GGGCTGAGTG TCCAGCACAC CTGCCGTCTT C ACTTCCCCA CAGGCTGGCG 
370 1 CTCGGCTCCA CCCCAGGGCC AGCTTTTCCT CACCAGGAGC CCGGCTTCCA 
3751 CTCCCC ACAT AGGAATAGTC C ATCCCC AG A TTCGCC ATTG TTC ACCCCTC 
380 1 GCCCTGCCCT CCTTTGCCTT CCACCCCCAC CATCCAGGTG GAGACCCTGA 
385 1 GAAGGACCCT GGGAGCTCTG GGAATTTGGA GTGACCAAAG GTGTGCCCTG 
390 1 TACACAGGCG AGGACCCTGC ACCTGGATGG GGGTCCCTGT GGGTCAAATT 
3951 GGGGGGAGGT GCTGTGGGAG TAAAATACTG AATATATGAG TTTTTCAGTT 
4001 TTGAAAAAAA AAAAAAAAAA AAAAAAAAA 



FIGURE 51 



GCAGCGCTGaTTCCTCCTGCGCACG 

1 + + + 4- v + GO 

CGTCrX^ACGCAGGACGAO 

5 A A t. R P A A H V G S e C F G H F K D A 

b Q k C V L L R T W E A L A P A T P A M P - 

c 5 A A S C C A R G K P W F F. P P F R C R - 

61 v \ i i i v 120 

C<XGCGAGGGGCGAa3^ 

A R £ P I, F S R A L P A A Q P T. P R G A 
b R A P K C K A V R S* L L R £ H * K K V L - 

c ALP A A E P C A P C C A A T T A R C C - 

iX03CTGGCC^CGTira 

121 + + + + ISO 

CGGCGACCGGTGCAAGCACGtrGCGa^CCCCGGGGTCCC^ 

!;Q 3 A A Q H V R A A P G A F G T. A A G A A R 

b FLATFVRRLGFOQWRLX r QRG- 
"2 c K W p R S C 0 A W C P R A G C W C S A G - 

J J Ga^CCCGGCGGCTTTCCGCGOT^^ 

f;M 181 + + + 4- + • ■ • -- + 

I; M CCTGC^CGCCOAAAGKX^ 

/j GPGGFPR7G G P ? : »j V R A L 6 '•' 

s=s% b DP A A K H A V V A ? C ? V C V p W ? 7 - 

'% r T R R L S A R W W P ? A W C A C F G ? - 

^ ; M AWGGCNGCCCCCCGCCGCCCCXTCCTIX^^ 

if! 241 * 4 + * + + 300 

p TrKTGtfcGCGCGGCGGCGGGc^ 

? A A P P R P L L P P G V L P E :' V G G 
b *> PFAAPSFRQ V S C L ? ? L V A - 

c G ? PFFPPP5A R C p A * 7 ? W W P - 

CCOAGTGCTOa^NANGC^ 
' J ,01 + + + * + + 360 

tJ P 5 A A ? A V R ? R R E R A G L ft L R 

b R V L ? ? b C ? R G A 7 N V L A F 0 F A - 

c £ C C - C A ? A A R ? T C W P 3 A S R - 

* + + i- i + A 20 

eGAeaACtrrGcai'CG^^ 

Cf A A G K G F R G P P R G L H H 0 R A 0 L 

b L L D G A R G G F P E A F T T S V R S V 

C C W T G P A G A F F R P 5 p p A C A A T - 



FIGURE 51 (cont.) 



CCTGCCCAACAaiGlTiACCGAC^^ 
4 21 h + + - + + - + 4X0 

h P A OHGDRRTAGEPGV G A A A A 

b LPNTVTDAT. RGSGAV/GLLL.K- 

C C P T H * F T H C C C A G k G G C C C A - 

CCCCtflX^^GAOXAO 

4S1 + + , + 1 * 540 

n PRCRRRAGSPA6TLR??CA 0 - 

b KVGDDVLVHLLARC A ? r v L v - 

C AWATTCWFTCWHAAR7T-CWW. 

GGmtxxACcrccGocTA^ 

c; 41 + + + — + + + Cuu 

OCtW^TICGACGCGGA 

□ ■„ G S C> L R L P ? V R A A A V p A R F C - 

iiO b V PS C A Y 7 V C G P P L Y £ L G A A T - 

H*f| C F A APT? C A G K R C T t" A L L - 

»-f! tcmgcccggotccgc^ 

;J &m - 4 1 1 -t + 1 hf.o 

ACK'GGGGCCGGGGGO^STGTGGsl^^ 

if\ j SCPAPATP?WTP?PLGSNGP- 

f; b «j A R P P P II A ? C? P E ? V W D P T G T, - 

\ : *) C RPGFRHTT. ?DT?ASG10RAW 

tl ^^CCATAGCGTCAGCGAGGCCCCa^ 

[jf i + + 1- + t ~20 

$ 1 OITTGGTATCGCAGTXrClTrCCGGC^ 

^ :i G T 1 A S C R P C S P W A A S P G C E E - 

b h* P * R 0 C G R C P P G L P A P G A R R - 

C NHSVREAGVPL G C Q P R V R G G - 

72i + r + + +• * 7S0 

CGaXXCO^GTT^Ci^ 

A R G 0 C Q P K S A V A Q E A Q A w K C 
b XGGSASRSLPLPKRPRKG A A 

A G A V r A E V C R C F R G P G V A L ? - 

CCXTf*iACCCCCACLX?^ 
7 61 + + 1 -r i 

GC^rB^SGCCTC^ 

a F * A C A D A K W A C V L G F F G 0 D A 

b P E P E R T P V 0 0 G S W A H P C R T h - 

b h R S G RPT, GPGPGPTRACRL- 



i 



FIGURE 51 (cont.) 



TGG\VXGAGTCaCCG^ 

yd } + - + + 1 • + $00 

AGtnG<XTCACI\>X^CCAAAGA^ 

a WTE*PWFLCGVTCQTPRBSH 
b G F S D R G F C V V £ P A R P A £ E A T 

C DPVTvvsvwcHLPDPPKKP?- 

901 + + -■ - + -+ * r 9Gu 

CACAMCCICCavCGCGAC^GAC^ 

:j L F C G C A L W H A P h P P T R G F r A 

b S L E G A L S C T K H S H P $ V G R O f| - 

LWRVRSLARATPTHPWAAST- 

CCAO^^GOCCCCXA^ 

Sf.l + — •-• + + 4 4 i 1020 

aTI^GCCCGGGGGCTAGGT^ 

a PKCPPIHIAATTSWDTPCPr 
b H A G P P - l ? T S R P P R P G T R L v p f 

TRAFHPHRGHHVLGHALSPG- 

CTGTAOXX^GAOTA^ 

1021 + + + - - * - — - ~ lutiU 

CACATGCGGCT-TT^TTTC^ 

V y A t T K H p L Y S S C D K ? T A ? L 
b C T P R P S T :? 5 T P Q A T '* T L R P *~ - 

t: V k h U o A L P L L L R P. 0 H C ? P P - 

CTTCCTACTCAATAT^^ 

1031 + * + + + 1 140 

G AAQ^TG A<~rTTAT ATAG AC 1 CCGG GTCGGAC 1GACCTJCAAGCCC i CCAAGCVJCi XTP'JT 

a L P V Q Y I * G P A * L A P G K f V £ T 

b F L L N I i £ A Q P D W R £ 0 G £ W R / - 

c : S Y 1- X Y L R P S L T G V R E V R G D ■• - 

ntltttcwjTb;c^jG(:ctt^ 

1141 + t + + + t 1200 

NAGAA^avJCA>/JG"ltt^ 

-? F L V p G L C Q D S P Q V a P P A P 

h r, f w F Q A L. D A P. I P R R L F R r. P - 

L-GFR PWMP G F P A G C r A C F - 

i:ni. + * + + - + 1 .?eP 

OXNATG^COirm^CGCCX^IKAC^PA^^ 

* A I A N A A ? V S G A A W E P R A V p 

b R Y W O M R r L F L E L t. G N II A O C ? - 

C ? T G K C G F C F W S C L G T T K S A .- - 



FIGURE $1 (cont.) 



i^A'T^^TCTTGCTCAAGACGC^ 

1^1 " - ♦ + + 

«JATG^XCACAAGGAGTTCTGCGT^ 

L R V P n D A ii F A A S C G H P S S R 

b v r; V F L K T H C ? L R A A V T P A A ^ " 

T C, C - <= R R T A R C E L R S P Q Q P V - 



a 



C 



b 
c 



+ + f 1360 

" ACAGACACC^TC^ 

c L C P G F. A P G T. C G G P R « ^ G T 0 " 
V r A K E K P 0 G - u A A P E E E E H R 

5 v P G R S P R A I. w R P r P F F N T D - 

*"^"** + + + v 1440 

^"^gj^ — — — — ■ 

TC^GGCACO^CCACGTC 

T FVAWCSC£A3TJ. AP G P C T A 



\, P P £ P 



Q A A A P P A Q 0 P I. A G V P L 



l> ft A L V 0 L L R Q H S S F W Q V Y G F - 

+ + t + 1500 

AGCAO>XGGGAOT^ 

- C G P A C A G W C P Q A S G A F G T T 
*p \ G l P A P A G A P P. P L G T. Q A Q P - 
V R A C L R R L V P P G L W G S R H K E - 

^TimJf-rlCC^CAGG^^^ 

" , + + is™ 

1501 + ^ + 

NAASSGTPRSSSPWGSMPSS 
T P T. P Q E H Q E V II L P G E A C o A L 

p R F L R N T K K K I S I. G K H A K L S - 

. " x ^ + t + lfcJO 

15S1 + * 

GCGACGTCCIO^Cl^ 



r C P. S * R O R * A C G T A L C C A G A 
A A G A D V E D E R A G L R L A A <j E^ P 
L Q 



ELTWKMSVKDCAWLRRSP 



n v + + + + - ■ H-S0 

crittce^ccGACAOA^^ 

0 o L A VFKPQSTVCVRR* W r P - 
p. (-, w L C S C ft R A r S A * G D P G O V - 
G V G C V P A A £ H K T. P E E I L A K - 



FIGURE 51 (cont.) 



TCCTrt\V:TG0ClUa:i\3AGT^^ 
)bHX , , . + + -- „ 1740 

AO^AOTTCACCGAcTraOir-A^ 

i f " T G * * V C T S S S C S 0 L S F H f. 

b F A 1- A D E C V K R R A A Q V F L L C H - 

C L H W T, m 5 V Y V v ELL.RS F F Y V T - 

1741 + +- --. .+ - . - - + . + U;0 o 

GCCrCiVXTlCC^A^ 

* ft R F K R T 0 -'?• F :? T G R V S 6 A I? 
b 0 D H V f K EQALFLPEECLEQV- 

L " E T T F Q K N R L F F Y R K S V W * K L - 



TGCAAVXAlTOSMT^^ 

j ^ + + + ^ 

ACGTTTOTMcXTTA^ 



^ K -I'FSD5T-KGCSCGSCRK 
A K H W N Q T A L E E C A A A C A V c t : 
O X T G T K Q H T, K R V Q t. R K T. S F, A 



CAaAGWO.(&CAG^R^^ 

, ( f - I 4- + 

CTltTC^ACTCWTt^^ 



0 r £ c s i g k p g p p c * r v d s a 5 
k g q a a s c s 0 a r p a d v 0 t f l h 

tccccw*cc7>g^ 

' + — -.«.. ----- . - + . . + + VJW() 

a<:***>ttcoi^ 

S P L T G C G K L * T W T T S W F. P V, 

P 0 A * R A A A DCEHCLRR G s Q N 
P K P D G L R P I V N M I> Y V V G a K T 



GGinXXGO^GAAA^GAGGCXr^^^ 

GCAASX'irrcr^T^ 



R £ A E K R C P S V £ P K G * P. H C 51 A 
VPORKEGKASHLEGECTVQR 
F R R E K R A E R L T S R V K A L F S V 



•rccTCA^TAca^xYjcraxrxr.^ 

+ + 4 + . + ^ 



Z £ T T S G R G A P A S W A P L C W A W 
A O L R A G A A P K P P G R r. C A G P 0 
L N V £ R A R R P G L L G A £ •« .3 f. - 



FIGURE 51 (cont.) 



ACGATATCCACAGGXCTC&JCC^rCTTO^^ 

1*101 ■ v + - ■ + + + I'l^O 

TGCTATAG^ITTCCCGGACa^ZGTO^ 

T I o T C E» G A P S C C v C ».i F R T R R 
R Y f $ L A H L R A A C AC PGP A A 

D I H R A W R T F V L R V R A 0 D h h P - 



C1CA<JC1CTACTTTGTCAA(X>^ 
-• m + + + , + 4 222(l 



T. £CTL.£RWM* RAPTTRHFRT 
* A V L C Q G G C D G R V R H H P P G O 
£ L Y F V K V D V T G A Y D T 1 L J o d K - 

GCCl^CGGAGGTCATCG^ 

2^1 ♦ + -- + + , , 22v0 

CO^A^rx^lCCAGTAGCGGTCGTAijTAGTTr^ 

GSRKSSPASSNPRTFTAC V G - 
AHGGHRQHH 0 TPEHVLRASV- 
L T £ V IAS I IK P Q K T Y C V R R y - 



•:!fil 



ATCGCGTOTOCAGAACXXX^ 
TA£GOCACraGGTCmX^ 

MPWSRRFPMGTSAfcPSRATS 
CRGP£CRPWAftP0GL0EPRL- 
A V V O K A A H G H V R k A F K S H V £ 

CTACCrraCAGACVIC^ 

« 41 + * . ?-ino 

^TGGAACTGTCTGGAGGTCC^ 

L P - 0 T S S R TODS S W L T C R ? T 
Y L D n P F A V H A T V R G £ P A G ? 0 - 
T I, T D L 0 P Y M K 0 V V/ A H L 0 > N S' - 



GCCCGCTC^XGATCCCGTOjPCATO 

+ + ► + + + 

Or^GCGACnm^AOXX^^ 



A R * G M P 5 S* 5 S R A P P * m R P A v 
P A E G C R R H R A E L L P E * G Q 0 W - 
PLRDAVVI EQSSSLNEAS 5 G - 

GCXTfCTIXXlACCTCITCCTACtX^ 

;«3.;a + + + 1 , A -js:u 

O^AG?^GCm"AGAAGGATXGAAGTACAO 

A S i T S ^ Y A i C A T T P C A S G A £ 
PLRRLPTLHVPPRRAHOGVV- 
L F D V K L R F M C H H A V R 7 R G K ^ - 




FIGURE 51 (cont.) 



p T S ^ R C, 55 R P A P 5 "3 P R C n A A 

b L n F V PG DP A G L H V r.KA A L Q F - 

V V n C 0 C I P 0 £ 1 L "J T LLC S T. - 

TG^XTACGft'GACATGGAGMCAAGCT^^^ 
:Sbl + ___ _ + 2 f>40 

ACACa^iCCOCICTACCTCm 

5 C A T A T W k T S C L R C F C G T C C S 

b V L R R H G E 0 A V C G D 5 A G F A A F - 

c C YGDMKWKLfAG I RRDCLLL- 

TCCGTITCGIY^AI^^^^^ 

-i + + + + 27oo 

ACGCAAACC a £CTACTA^ 

Q 5 C V W W M T S C W • H L T £ P T K K P S 

; ' J, b A PG»"t t f t.vgdts r h trenlp 

^ r; K L V 0 D F L L V T P H L T H A K T t* L - 

raGGAccei^eecA^ 

t 1 1 + + 1- 27 to 

^ :i S C P W 3 E V 5 L S M A A W * T C G F 0 

b QDPGPRCP *VWT. EGELA F. - "D S - 

^. - FTLVRGVPF. YGCVVNLRKTV 

?F ICGTCAACTTCCCTXjTA.GAAGACGAG^ 

|p ACCACTTGAAG<oGAC^^ 

|T a W r T 5 f, " K T K P W '«/ A K L. L. K R C K 

r "~ h G E L P C R R R C P C W H C F C £ D A rj - 

c V N r P V E D E A T. Q G T A F V Q M P A - 

CCC^JGCtTArTC^ 

idl'l + + + + + i 

u P T A Y S P G A ACCWIPCPWR 0 R 

b P fc ? I P L V R p A A G Y P D P G G A £ - 

r. H L F P W C G L L L D T R T I, E V Q S - 

GCGACrACFJCACCTATC^^^ 
j 0 h 1 + +■ 1 + * + £940 

a A T T P A M P G P P S E P V S P £ T A A 

P. L L Q L r f D L H 0 k: Q S II L 0 P P L - 
<: DYSSYARTSIRASl. TFNP.GP- 



FIGURE 51 (ctfnt.) 



C 



AOTn^X<ACCCTCCTTCTC^ 



h S F L G G T C V A N i' L G ^ C G * S V T 

b Q G W £ E H A &t?TLWGLAAE V 0 

c K A G R N M R R K I. F G V T. Ft T, K C H 



GCC^Tm^CATTTC^^ 
CGGACA^GACCTAAACGTCCACITGICGGA^ 



-3 A C F W I C R ' T A S K K C A if* 1" ^' T K 

h i> V ^ C F A G E 0 P P D <3 V H Q H L Q P 

t - : lFLDLOVNST. QTVCTNTYKT 

2061 * + - - - ■+ -. - + - - 

ACCAC^XEAOimrG^^ 



a S S C C R R T G F T H V C C if S H b* 1 S 

b r T A A G V o V S R M C A A API? ^ A 

L h t. Q A Y R F H A C V L O L P f H 0 0 



A^TITiraAAGAACrc^ 
n . . ._ + < H t ► jlo'G 

TTCAAACCriTtTlTCGGGTCTAAAAJ^ 

K , f G R T P M F S C A $ S L- T r< P F S A - 
5LEEPHIFPA R H T» * HGLPL. K 
VWKNPTFFLP. VISD T A £ L L Y 

ACTOZA'im^AMO^^^ 

+ + + * ^24n 

TCAo^^criictxrriOT^^ 

T P L" * K F R 7 Q G C F: W <; P E A F P A 
L H F K S 0 F R R D V A G G O G R R R P 
SILK A K N A G M S L C A K C A A G F - 



L t; ? p P p C 5 G C A T K H 5 C S 5 * T. 
S A I. P- G R A V A V P P 3 I P A Q A D S 
L P S E A V Q W L C HQAFLLKLTF: 



GACACCtrPTPCAOCTAOim^^ 

___ .+ + + + - -■ - • + --- — 



p T V S P T C H S W G H P G Q F P P S * 
T P C H 1 R - a T T G V T 0 D S P 0 A A E 
H R V T V v F L L G 5 L F T A 0 T Q T- 51 



FIGURE* 51 (cbnt.) 



GTo:/iWiCica:^G>3AO^ 
m^i - + + + 1 ;i4>(i 

CArXClTCaAGGGCCCCTGCTGC^^^ 

V G f F R G R ^ * I. T W p P Q r t R II C 
S E A F G D D * D C P G G R S Q p T A - 

R K L ? G T T L T A L E A A A N P A L ? - 

CCTCAGACnCAViACCAltCI^GACT 
M>1 t t i + + - .Mb'U 

P Q T £ ? r £ W T 0 G H P P T =>. R P R A 
L R L 0 U H P G L M A T P. P Q P G R R \ : - 

o [i p K T I L D * W P P A H S U A h I K - 



i * + • - - - + i 1 

CTCT^TCGTLTO^^ 



D T £ S P V T P G £ T S Q G G K G G P H 
TFAALSRRALKPKEGGAAHT- 
H 0 Q P C H A G L Y V P G R F. G P. P T P * 

(XAG^CCGCACCGCra^ 



541 



00TCO3GGCGTGGaiACa^^^ 

P G P H R W E 5 E A * V £ V W P K P A C 
tjAPTAG£LRPK*V7FCRGLH V 
~k P A P L G V * G L S E C r , A E A C M S 



GGCCG^CTTCCGACit^W 



p A E G * V S G * G L £ L C P A K C * V 
R L K R C P A h A * A 5" V Q P R A £ 0 

G * F L s V R L R P E F: V S ? 0 G L £ v - 

^GAGOrACCTGCO^TCTTCAC 

, U1 + + _— h 2-:-o 

AGGTCGTGTGGACGG^X;AAJ^ 

5 S r P A V F T G P 0 A G A R t. H P R A 
p A H L P 5 £ L P H R L A L G £ T P G P - 
0 H T C P. T- H F P T G W p. S A P P Q G 0 - 

AGCPnTal'TCACCAi^ 
o + + ♦ + . . - - + + J.78U 



TCGAAJvYSGAGTCGTO^^ 

S F S % P G A ft L P L P T ' E ' £ T P P 
A P ? H 0 E P G F H £ P H R N £ P £ P 
T, F I T K £ P A S T P H I «i I V H f Q I 



FIGURE 51 (cont.) 



^ i - + - — + 1 • 



f a : v n p p c p r l r s t p t i v v 

5 t L F T P R P A L L C L P F P P ? R >• 

r .- "ii c z p l c l p f a r h ? n n p - : 



... + * * — + - *■ 



r T _ K R T L G /\ L £ T W S D 0 F. V A L 
K t • E ij P W E T . W E F G V T K G V P 
r. ? C V" D r G S 3 C N L £ " P K V 0 p V 



V T ^ F Pj P A P G W G S T. W V K L G t; 

0 A H T L H L 0 G G I* C G S N W >, P. V 
H R R G P C T W M C V P V .J n 1 G G K C 



35 f.i + + + + 1 , 4f 

O'jACACcXTtlATTTTATVJACTT^^ 

A V G V K Y * I Y E F F £ F E K K K K K 
LWE'NTEYM5FS V L K K K K K K 
CGSKTt.NI r V F Q F ' K K K K K K - 

AAAAAaAaA 

qf\21 40^' 

TITl ' lTl ' IT 

K K K 
K K 

K K 



FIGURE 52 



BSPHl 6951, 
APAL1 6733, 



ESPH1 5943 



APAL1 



DRA3 22 5 

.XH01 6 74 
'ECOR5 7 01 
ECOR1 7 07 
r NOTl 713 
MSC1 S52 

SAC 2 1102 



SMA1 1435 
SMA1 1555 



ECOR1 4 771 
NOT1 4763 
SAC2 4 7 52 
NOT1 47 43 




SAC2 4817 
NOT1 4807 
BAMHl 4789 
SMA1 4733 



SMA1 2 061 



MSC1 24 03 



XHOl 2746 
ECOR5 2 334 



MSC1 4181 
SMAI 4102 

APAL1 3772 
SAC2 3 664 



DRA3 3227 
BAMHl 3270 



FIGURE 53 



1 

met 

GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCG ATG 

10 

pro arg ala pro arg cys arg ala val arg ser leu leu arg ser 
CCG CGC GCT CCC CGC TGC CGA GCC GTG CGC TCC CTG CTG CGC AGC 

20 30 
his tyr arg glu val leu pro leu ala thr phe val arg arg leu 
CAC TAG CGC GAG GTG CTG CCG CTG GCC ACG TTC GTG CGG CGC CTG 

40 

gly pro gin gly trp arg leu val gin arg gly asp pro ala ala 
GGG CCC CAG GGC TGG CGG CTG GTG CAG CGC GGG GAC CCG GCG GCT 

50 SC 
phe arg ala leu val ala gin cys leu val cys val pro trp asp 
TTC CGC GCG CTG GTG GCC CAG TGC CTG GTG TGC GTG CCC TGG GAC 

70 

ala arg pro pro pro ala ala pro ser phe arg gin val ser cys 
GCA CGG CCG CCC CCC GCC GCC CCC TCC TTC CGC CAG GTG TCC TGC 

80 90 
leu lys glu leu val ala arg val leu gin arg leu cys glu arg 
CTG AAG GAG CTG GTG GCC CGA GTG CTG CAG AGG CTG TGC GAG CGC 

100 

gly ala lys asn val leu ala phe gly phe ala leu leu asp gly 
GGC GCG AAG AAC GTG CTG GCC TTC GGC TTC GCG CTG CTG GAC GGG 

110 120 
ala arg gly gly pro pro glu ala phe thr thr ser val arg ser 
GCC CGC GGG GGC CCC CCC GAG GCC TTC ACC ACC AGC GTG CGC AGC 

A. 

( 

130 

tyr leu pro asn thr val thr asp ala leu arg gly ser glv ala 
TAC CTG CCC AAC ACG GTG ACC GAC GCA CTG CGG GGG AGC GGG GCG 
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trp gly leu 
TGG GGG CTG 



leu leu ala 
CTG CTG GCA 



ala tyr gin 
GCC TAC CAG 



thr gin ala 
ACT CAG GCC 



leu gly cys 
CTG GGA TGC 



val pro leu 
GTC CCC CTG 



ser ala ser 
AGT GCC AGC 



ala ala pro 
GCT GCC CCT 



ala his pro 
GCC CAC CCG 



140 

leu leu arg 
CTG CTG CGC 



arg cys ala 
CGC TGC GCG 

170 

val cys gly 
GTG TGC GGG 



arg pro pro 
CGG CCC CCG 

200 

glu arg ala 
GAA CGG GCC 



gly leu pro 
GGC CTG CCA 

230 

arg ser leu 
CGA AGT CTG 



glu pro glu 
GAG CCG GAG 

260 

gly arg thr 
GGC AGG ACG 



arg val gly 
CGC GTG GGC 

160 

leu phe val 
CTC TTT GTG 



pro pro leu 
CCG CCG CTG 

190 

pro his ala 
CCA CAC GCT 



trp asn his 
TGG AAC CAT 

220 

ala pro gly 
GCC CCG GGT 

pro leu pro 
CCG TTG CCC 

250 

arg thr pro 
CGG ACG CCC 



arg gly pro 
CGT GGA CCG 



asp asp val 
GAC GAC GTG 



leu val ala 
CTG GTG GCT 



tyr gin leu 
TAC CAG CTC 



ser gly pro 
AGT GGA CCC 



ser val arg 
AGC GTC AGG 



ala arg arg 
GCG AGG AGG 



lys arg pro 
AAG AGG CCC 



val gly gin 
GTT GGG CAG 



ser asp arg 
AGT GAC CGT 



150 
leu val his 
CTG GTT CAC 



pro ser cys 
CCC AGC TGC 

180 

gly ala ala 
GGC GCT GCC 



arg arg arg 
CGA AGG CGT 

210 
glu ala gly 
GAG GCC GGG 



arg gly gly 
CGC GGG GGC 

240 

arg arg gly 
AGG CGT GGC 



gly ser trp 
GGG TCC TGG 

270 
gly phe cys 

GGT TTC TGT 



230 

val val ser pro ala arg pro ala glu glu ala thr ser leu glu 
GTG GTG TCA CCT GCC AGA CCC GCC GAA GAA GCC ACC TCT TTG GAG 
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290 300 

gly ala leu ser gly thr arg his ser his pro ser val gly arg 

GGT GCC- CTC TCT GGC ACG CGC CAC TCC CAC CCA TCC GTG GGC CGC 

310 

gin his his ala gly pro pro ser thr ser arg pro pro arg pro 

CAG CAC CAC GCG GGC CCC CCA TCC ACA TCG CGG CCA CCA CGT CCC 

320 330 

crp asp thr pro cys pro pro val tyr ala glu thr lys his phe 

TGG GAC ACG CCT TGT CCC CCG GTG TAC GCC GAG ACC AAG CAC TTC 

340 

leu tyr ser ser gly asp lys glu gin leu arg pro ser phe leu 

CTC TAC TCC TCA GGC GAC AAG GAG CAG CTG CGG CCC TCC TTC CTA 

350 360 

leu ser ser leu arg pro ser leu thr gly ala arg arg leu val 

CTC AGC TCT CTG AGG CCC AGC CTG ACT GGC GCT CGG AGG CTC GTG 

370 

glu thr iie phe leu gly ser arg pro trp met pro gly thr pro 

GAG ACC ATC TTT CTG GGT TCC AGG CCC TGG ATG CCA GGG ACT CCC 

380 390 

arg arg leu pro arg leu pro gin arg tyr trp gin met arg pro 

CGC AGG TTG CCC CGC CTG CCC CAG CGC TAC TGG CAA ATG CGG CCC 

400 

leu phe leu glu leu leu gly asn his ala gin cys pro tyr gly 

CTG TTT CTG GAG CTG CTT GGG AAC CAC GCG CAG TGC CCC TAC GGG 

410 420 

val leu leu lys thr his cys pro leu arg aia aia val thr pro 

GTG CTC CTC AAG ACG CAC TGC CCG CTG CGA GCT GCG GTC ACC CCA 

430 

ala ala gly val cys ala arg glu lys pro gin gly ser val ala 

GCA GCC GGT GTC TGT GCC CGG GAG AAG CCC CAG GGC TCT GTG GCG 
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440 

aia pre glu glu glu asp thr 
GCC CCC GAG GAG GAG GAC ACA 



leu arg gin his ser ser pro 
CTC CGC CAG CAC AGC AGC CCC 

470 

ala cys leu arg arg leu val 
GCC TGC CTG CGC CGG CTG GTG 

his asn glu arg arg phe leu 
CAC AAC GAA CGC CGC TTC CTC 

500 

Isu giy iys his ala lys leu 
CTG GGG AAG CAT GCC AAG CTC 



met ser val arg asp cys ala 
ATG AGC GTG CGG GAC TGC GCT 

530 

gly cys val pro ala ala glu 
GGC TGT GTT CCG GCC GCA GAG 



ala lys phe leu his trp leu 
GCC AAC- TTC CTG CAC TGG CTG 

560 

leu arg ser phe phe tyr val 
CTC AGG TCT TTC TTT TAT GTC 

arg leu phe phe tyr arg pro 
AGG CTC TTT TTC TAC CGG CCG 

590 

ile gly ile arg gin his leu 
ATT GGA ATC AGA CAG CAC TTG 



450 

asp pro arg arg leu val gin leu 
GAC CCC CGT CGC CTG GTG CAG CTG 

460 

trp gin val tyr gly phe val arg 
TGG CAG GTG TAC GGC TTC GTG CGG 

480 

pro pro gly leu trp gly ser arg 
CCC CCA GGC CTC TGG GGC TCC AGG 

490 

arg asn thr lys lys phe ile ser 
AGG AAC ACC AAG AAG TTC ATC TCC 

510 

ser leu gin glu leu thr trp lys 
TCG CTG CAG GAG CTG ACG TGG AAG 

520 

trp leu arg arg ser pro gly val 
TGG CTG CGC AGG AGC CCA GGG GTT 

540 

his arg leu arg glu glu ile leu 
CAC CGT CTG CGT GAG GAG ATC CTG 

550 

met ser val tyr val val glu leu 
ATG AGT GTG TAC GTC GTC GAG CTG 

570 

thr glu thr thr phe gin lys asn 
ACG GAG ACC ACG TTT CAA AAG AAC 

580 

ser val trp ser lys leu gin ser 
AGT GTC TGG AGC AAG TTG CAA AGC 

600 

lys arg val gin leu arg glu leu 
AAG AGG GTG CAG CTG CGG GAG CTG 
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ser glu ala glu val 
TCG GAA GCA GAG GTC 

620 

leu thr ser arg leu 
CTG ACG TCC AGA CTC 



pro ile val asn met 
CCG ATT GTG AAC ATG 

650 

arg glu lys arg ala 
AGA GAA AAG AGG GCC 



phe ser val leu asn 
TTC AGC GTG CTC AAC 

680 

gly ala ser val leu 
GGC GCC TCT GTG CTG 



thr phe val leu arg 
ACC TTC GTG CTG CGT 

710 

tyr phe val lys val 
TAC TTT GTC AAG GTG 



gin asp arg leu thr 
CAG GAC AGG CTC ACG 

740 

asn thr tyr cys val 
AAC ACG TAC TGC GTG 



his gly his val arg 
CAT GGG CAC GTC CGC 



610 

arg gin his arg glu 
AGG CAG CAT CGG GAA 

arg phe ile pro lys 
CGC TTC ATC CCC AAG 

640 

asp tyr val val giy 
GAC TAC GTC GTG GGA 

glu arg leu thr ser 
GAG CGT CTC ACC TCG 

670 

tyr glu arg ala arg 
TAC GAG CGG GCG CGG 



gly leu asp asp ile 
GGC CTG GAC GAT ATC 

700 

val arg ala gin asp 
GTG CGG GCC CAG GAC 

asp val thr gly ala 
GAT GTG ACG GGC GCG 

720 

glu val ile ala ser 
GAG GTC ATC GCC AGC 



arg arg tyr ala val 
CGT CGG TAT GCC GTG 

760 

lys ala phe lys ser 
AAG GCC TTC AAG AGC 



ala arg pro ala leu 
GCC AGG CCC GCC CTG 

630 

pro asp gly leu arg 
OCT GAC GGG CTG CGG 

ala arg thr phe arg 
GCC AGA ACG TTC CGC 

660 

arg val lys ala leu 
AGG GTG AAG GCA CTG 

arg pro gly leu leu 
CGC CCC GGC CTC CTG 

690 

his arg ala trp arg 
CAC AGG GCC TGG CGC 



pro pro pro glu leu 
CCG CCG CCT GAG CTG 

720 

tyr asp thr ile pro 
TAC GAC ACC ATC CCC 

ile ile lys pro gin 
ATC ATC AAA CCC CAG 

750 

val gin lys ala ala 
GTC CAG AAG GCC GCC 



his val ser thr leu 
CAC GTC TCT ACC TTG 
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770 "80 

thr asp leu gin pro tyr mei arg gin phe val ala his leu gin 

ACA GAC CTC CAG CCG TAC ATG CGA CAG TTC GTG GCT CAC CTG CAG 

790 

glu thr ser pro leu arg asp ala val val ile glu gin ser ser 

GAG ACC AGC CCG CTG AGG GAT GCC GTC GTC ATC GAG CAG AGC TCC 

600 . 810 

ser leu asn glu ala ser ser gly leu phe asp val phe leu arg 

TCC CTG AAT GAG GCC AGC AGT GGC CTC TTC GAC GTC TTC CTA CGC 



820 

phe met cys his his ala val arg ile arg gly lys ser tyr val 
TTC ATG TGC CAC CAC GCC GTG CGC ATC AGG GGC AAG TCC TAC GTC 



830 



840 



gin cys gin gly ile pro gin gly ser ile leu ser zhr leu leu 
CAG TGC CAG GGG ATC CCG CAG GGC TCC ATC CTC TCC ACG CTG CTC 



850 

cys ser ieu cys tyr gly asp met glu asn lys leu phe ala gly 
TGC AGC CTG TGC TAC GGC GAC ATG GAG AAC AAG CTG TTT GCG GGG 



860 870 

ile arg arg asp gly leu leu leu arg leu val asp asp phe leu 

ATT CC-G COG GAC GGG CTG CTC CTG CGT TTG GTG GAT GAT TTC TTG 

880 

leu val thr pro his leu thr his ala lys thr phe ieu arg thr 

TTG GTG ACA CCT CAC CTC ACC CAC GCG AAA ACC TTC CTC AGG ACC 

890 900 

leu val arg gly val pro glu tyr gly cys val val asn leu arg 

CTG GTC CGA GGT GTC CCT GAG TAT GGC TGC GTG GTG AAC TTG CGG 



lys thr val val asn phe pro 
AAG ACA GTG GTG AAC TTC CCT 

920 

thr ala phe val gin met pro 
ACG GCT TTT GTT CAG ATG CCG 



910 

val glu asp glu ala leu gly gly 
GTA GAA GAC GAG GCC CTG GGT GGC 

930 

ala his gly leu phe pro trp cys 
GCC CAC GGC CTA TTC CCC TGG TGC 
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gly leu leu leu asp 
GGC CTG CTG CTG GAT 

950 

ser ser tyr ala arg 
TCC AGC TAT GCC CGG 

arg gly phe lys ala 
CGC GGC TTC AAG GCT 

980 

val leu arg leu lys 
GTC TTG CGG CTG AAG 



asn ser leu gin thr 
AAC AGC CTC CAG ACG 

1010 

leu gin ala tyr arg 
CTG CAG GCG TAC AGG 

his gin gin val trp 
CAT CAG CAA GTT TGG 

1040 

ser asp thr ala ser 

TCT GAC ACG GCC TCC 



ala gly met ser leu 
GCA GGC- ATG TCG CTG 

1070 

ser glu ala val gin 
TCC GAG GCC GTG CAG 



leu thr arg his arg 
CTG ACT CGA CAC CGT 



940 

thr arg thr leu glu 
ACC CGG ACC CTG GAG 



thr ser ile arg ala 
ACC TCC ATC AGA GCC 

970 

gly arg asn met arg 
GGG AGG AAC ATG CGT 

cys his ser leu phe 
TGT CAC AGC CTG TTT 

1000 

val cys thr asn ile 

GTG TGC ACC AAC ATC 



phe his ala cys val 
TTT CAC GCA TGT GTG 

1030 

lys asn pro thr phe 
AAG AAC CCC ACA TTT 



leu cys tyr ser ile 

CTC TGC TAC TCC ATC 
1060 

gly ala lys gly ala 
GGG GCC AAG GGC GCC 



trp leu cys his gin 
TGG CTG TGC CAC CAA 

1090 

val thr tyr val pro 

GTC ACC TAC GTG CCA 



val gin ser asp tyr 
GTG CAG AGC GAC TAC 

960 

ser val thr phe asn 
AGT GTC ACC TTC AAC 

arg lys leu phe gly 
CGC AAA CTC TTT GGG 

990 

leu asp leu gin val 
CTG GAT TTG CAG GTG 



tyr lys ile leu leu 
TAC AAG ATC CTC CTG 

1020 

leu gin leu pro phe 
CTG CAG CTC CCA TTT 



phe leu arg val ile 
TTC CTG CGC GTC ATC 

1050 

leu lys ala lys asn 

CTG AAA GCC AAG AAC 



ala gly pro leu pro 
GCC GGC CCT CTG CCC 

1080 

ala phe leu leu lys 
GCA TTC CTG CTC AAG 



leu leu gly ser leu 
CTC CTG GGG TCA CTC 
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arg cn- ala cln thr gin leu ser arg lys leu pro gly thr thr 
AGG ACA GCC CAG ACG CAG CTG AGT CGG AAG CTC CCG GGG ACG ACG 

1120 

leu th- ala leu alu ala ala ala asn pro ala leu pro ser asp 
CTG AC? GCC CTG GAG GCC GCA GCC AAC CCG GCA CTG CCC TCA GAC 

1130 1132 
phe Ivs thr ile leu asp OP 

TTC AAG ACC ATC CTG GAC TGA TGGCCACCCGCCCACAC-CCAGGCCGAGAGCAGA 

CACCAC-CAGCCCTGTCACGCCGGGCTCTACGTCCCAGGGAGGGAGGGGCGGCCCACACCC 

AGGCCCGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGGCCGAGGCCTGCATGTCC 

GGCTGAAGGCTGAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTC 

CAGCACACCTGCCGTCTTCACTTCCCCACAGGCTGGCGCTCGGCTCCACCCCAGGGCCAG 

CTTTTCYTCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATCCCCAGATT 

CGCCATTGTTCACCCYTCGCCCTGCCYTCCTTTGCCTTCCACCCCCACCATCCAGGTGGA 

GACCCTGAGAAGC-ACCCTGGC-AGCTCTGGGAATTTGGAGTGACCAAAGGTGTGCCCTGTA 

CACAGGCGAGGACCCTGCACCTGGATGGGGGTCCCTGTGGGTCAAATTGGGGGGAGGTGC 

TGTGGGAGTAAAATACTGAATATATGAGTTTTTCAXjTTTTGRAAAAAAAAAAAAAAAAAA 

AAAAAAAAAA 
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